目录

Model Hosting Container Standards

Python toolkit for standardized model hosting container implementations with Amazon SageMaker integration.

Overview

This repository provides a Python toolkit that enables TensorRT-LLM and vLLM integration with Amazon SageMaker hosting platform for efficient model deployment and inference.

Repository Structure

ModelHostingContainerStandards/
├── python/                    # Python implementation
│   ├── model_hosting_container_standards/  # Main Python package
│   │   ├── __init__.py
│   │   ├── config.py
│   │   ├── logging_config.py
│   │   ├── utils.py
│   │   ├── common/            # Common utilities
│   │   │   ├── fastapi/       # FastAPI integration
│   │   │   ├── custom_code_ref_resolver/  # Dynamic code loading
│   │   │   └── handler/       # Handler specifications
│   │   └── sagemaker/         # SageMaker integration
│   │       ├── lora/          # LoRA adapter support
│   │       └── sessions/      # Stateful session management
│   ├── tests/                 # Package tests
│   ├── examples/              # Python examples and demos
│   ├── pyproject.toml         # Python project configuration
│   ├── Makefile               # Build automation
│   └── README.md              # Python-specific documentation
├── docs/                      # Documentation
├── examples/                  # Top-level examples
├── .github/                   # GitHub templates and workflows
├── Config                     # Shared configuration files
└── README.md                  # This file

Quick Start

cd python
poetry install
poetry shell

See the Python README for detailed usage instructions, examples, and development workflow.

Contributing

When contributing to this repository:

  1. Place Python-specific code in the python/ directory
  2. Follow the established patterns for project structure
  3. Include tests for new functionality
  4. Update documentation as needed
  5. Run pre-commit hooks to ensure code quality

Security

See CONTRIBUTING for more information.

License

This project is licensed under the Apache-2.0 License.

邀请码
    Gitlink(确实开源)
  • 加入我们
  • 官网邮箱:gitlink@ccf.org.cn
  • QQ群
  • QQ群
  • 公众号
  • 公众号

版权所有:中国计算机学会技术支持:开源发展技术委员会
京ICP备13000930号-9 京公网安备 11010802032778号