CAMB.AI 的 MARS5 语音模型 (TTS)
This is the repo for the MARS5 English speech model (TTS) from CAMB.AI.
这是 CAMB.AI 的 MARS5 英语语音模型 (TTS) 的存储库。
The model follows a two-stage AR-NAR pipeline with a distinctively novel NAR component (see more info in the Architecture).
该模型遵循两级 AR-NAR 管道,具有独特新颖的 NAR 组件(请参阅架构中的更多信息)。
With just 5 seconds of audio and a snippet of text, MARS5 can generate speech even for prosodically hard and diverse scenarios like sports commentary, anime and more. Check out our demo:
只需 5 秒的音频和一段文本,MARS5 就可以生成语音,即使是体育评论、动漫等韵律困难且多样化的场景。看看我们的演示:
官网地址:www.camb.ai
源码地址:https://github.com/camb-ai/mars5-tts