Python toolkit for standardized model hosting container implementations with Amazon SageMaker integration.
Overview
This repository provides a Python toolkit that enables TensorRT-LLM and vLLM integration with Amazon SageMaker hosting platform for efficient model deployment and inference.
Model Hosting Container Standards
Python toolkit for standardized model hosting container implementations with Amazon SageMaker integration.
Overview
This repository provides a Python toolkit that enables TensorRT-LLM and vLLM integration with Amazon SageMaker hosting platform for efficient model deployment and inference.
Repository Structure
Quick Start
See the Python README for detailed usage instructions, examples, and development workflow.
Contributing
When contributing to this repository:
python/directorySecurity
See CONTRIBUTING for more information.
License
This project is licensed under the Apache-2.0 License.