文章预览
项目简介 实时流式数字头像,实现音视频同步对话,基本达到商用效果,支持文本和语音交互,适用于直播间业务和展厅显示屏互动 特点 文本交互 语音交互 SyncTalk项目支持 声音克隆 直播间业务 展厅显示屏互动 测试 在 Ubuntu 18.04, Pytorch 1.12.1 和 CUDA 11.3上测试 git clonehttps://github.com/Hujiazeng/Vach.git cd Vach 依赖安装 conda create -n Vach python==3.10 conda activate Vach pip install torch==1.12.1+cu113 torchvision==0.13.1+cu113 torchaudio==0.12.1 --extra-index-url https://download.pytorch.org/whl/cu113 pip install -r requirements.txt pip install --no-index --no-cache-dir pytorch3d -f https://dl.fbaipublicfiles.com/pytorch3d/packaging/wheels/py38_cu113_pyt1121/download.html # Note the following modules. If installation is unsuccessful, you can navigate to the path and use pip install . or python setup.py install to compile and install. # NeRF/freqencoder # NeRF/gridencoder # NeRF/raymar
………………………………