目录

INSPIRE: Instruction-based Multi-Task Speech and Audio Processing Benchmark

Introduction

INSPIRE is an INstruction-based multi-task SPeech and audIo pRocessing bEnchmark. INSPIRE is built to help benchmark speech foundation models and it includes dataset and models. INSPIRE can be used for cross-modal tasks including speech-to-text, text-to-speech, speech-to-speech, and audio-to-text tasks in the range from recognition, understanding and generation.

Dataset

  • INSPIRE dataset (coming soon)

Models

  • (coming soon)
    ## License This project is licensed under [The MIT License](https://opensource.org/licenses/MIT). INSPIRE also contains various third-party components and some code modified from other repos under other open source licenses.
关于
29.0 KB
邀请码
    Gitlink(确实开源)
  • 加入我们
  • 官网邮箱:gitlink@ccf.org.cn
  • QQ群
  • QQ群
  • 公众号
  • 公众号

版权所有:中国计算机学会技术支持:开源发展技术委员会
京ICP备13000930号-9 京公网安备 11010802032778号