Currently, virtual humans are widely used in various industries, and the purpose of this project is to help developers quickly migrate virtual human projects based on other GPU platforms to the MetaX GPU platform.
LatentSync is an end-to-end lip-sync method based on audio-conditioned latent diffusion models without any intermediate motion representation, diverging from previous diffusion-based lip-sync methods based on pixel-space diffusion or two-stage generation.
OpenAvatarChat is a modular interactive virtual human dialogue implementation that can run full functionality on a single PC. Currently, it supports MiniCPM-o as a multimodal language model or can use cloud APIs to replace the standard ASR + LLM + TTS implementation.
Virtual Human Models for Metax GPU Platform
Introduction
Currently, virtual humans are widely used in various industries, and the purpose of this project is to help developers quickly migrate virtual human projects based on other GPU platforms to the MetaX GPU platform.
Available Projects
1. Wan2.2-S2V for Metax GPU platform
2. LatentSync for Metax GPU platform
3. CosyVoice for Metax GPU Platform
4. OpenAvatarChat for Metax GPU Platform
Other Projects for Metax GPU Platform
License
This project is released under the MIT. Contributions and usage are warmly welcomed.