上海交通大学开源的非常牛音生成模型 F5-TTS

GitHubStore · 公众号 · · 2024-10-25 09:16

文章预览

项目简介 F5-TTS ：带有 ConvNeXt V2 的扩散变压器，训练和推理速度更快。 E2 TTS ：Flat-UNet Transformer，最接近纸面的复制品。 Sway Sampling ：推理时间流步进采样策略，大大提高性能安装克隆存储库： git clone https://github.com/SWivid/F5-TTS.git cd F5-TTS 使用您的 CUDA 版本安装 torch，例如： pip install torch== 2.3 .0 +cu118 --extra-index-url https://download.pytorch.org/whl/cu118 pip install torchaudio== 2.3 .0 +cu118 --extra-index-url https://download.pytorch.org/whl/cu118 安装其他包： pip install -r requirements .txt [可选] ：我们提供了 Dockerfile ，您可以使用以下命令来构建它。 docker build -t f5tts :v1 . 开发发出 Pull 请求时，请使用预提交来确保代码质量： pip install pre- commit pre- commit install 这将在每次提交之前自动运行 linter 和格式化程序。 Manually run using: 手动运行使用： pre- commit run ………………………………

原文地址：访问原文地址
快照地址：访问文章快照
总结与预览地址：访问总结与预览

分享到微博