目录
Peng Lin5个月前1次提交

Virtual Human Models for Metax GPU Platform

logo

Introduction

Currently, virtual humans are widely used in various industries, and the purpose of this project is to help developers quickly migrate virtual human projects based on other GPU platforms to the MetaX GPU platform.

Available Projects

1. Wan2.2-S2V for Metax GPU platform

  • Wan2.2-S2V is an audio-driven cinematic video generation model.
Product Image VirtualHuman Image VirtualHuman Video

2. LatentSync for Metax GPU platform

  • LatentSync is an end-to-end lip-sync method based on audio-conditioned latent diffusion models without any intermediate motion representation, diverging from previous diffusion-based lip-sync methods based on pixel-space diffusion or two-stage generation.
Original Video Translated Video

3. CosyVoice for Metax GPU Platform

  • CosyVoice is a powerful voice generation model, which supports multiple languages, with fast and stable generation.
Input Voice Output Voice

4. OpenAvatarChat for Metax GPU Platform

  • OpenAvatarChat is a modular interactive virtual human dialogue implementation that can run full functionality on a single PC. Currently, it supports MiniCPM-o as a multimodal language model or can use cloud APIs to replace the standard ASR + LLM + TTS implementation.
Demo Video

Other Projects for Metax GPU Platform

License

This project is released under the MIT. Contributions and usage are warmly welcomed.

关于
2.9 MB
邀请码
    Gitlink(确实开源)
  • 加入我们
  • 官网邮箱:gitlink@ccf.org.cn
  • QQ群
  • QQ群
  • 公众号
  • 公众号

版权所有:中国计算机学会技术支持:开源发展技术委员会
京ICP备13000930号-9 京公网安备 11010802047560号