TTS

Vits chinese, tts chinese, tts mandarin

Feb 08, 2022 1 min read

vits实现的中文TTS

this is the copy of https://github.com/jaywalnut310/vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Espnet连接：github.com/espnet/espnet/tree/master/espnet2/gan_tts/vits

coqui-ai/TTS连接：github.com/coqui-ai/TTS/tree/main/recipes/ljspeech/vits_tts

如果有侵权行为，请联系我，我将删除项目

If there is infringement, please contact me and I will delete the item

基于VITS 实现 16K baker TTS 的流程记录

apt-get install espeak

pip install -r requirements.txt

cd monotonic_align

python setup.py build_ext –inplace

将16K标贝音频拷贝到./baker_waves/，启动训练

python train.py -c configs/baker_base.json -m baker_base

两张1080卡，训练两天，基本可以使用了

测试

python vits_strings.py

上面的模型训练出来后存在，明显停顿的问题

原因：

1，本来已经在音素后面强插边界了，VITS又强插边界了，具体是配置参数：”add_blank”: true

2，可能影响，随机时长预测，具体配置参数：use_sdp=True

样例音频

vits_样本.wav

GitHub

View Github

TTS Text-to-Speech

John was the first writer to have joined pythonawesome.com. He has since then inculcated very effective writing and reviewing culture at pythonawesome which rivals have found impossible to imitate.

Bot

A simple TTS bot for discord made with python

25 January 2022

Text-to-Speech

A Simple Python Program Which Converts Your Text to Speech

Text-to-Speech-Converter This is Simple Python Program Which Converts Your Text to Speech. Requirements First install Pyttsx3 using command : pip install pyttsx3 what does code provided do The code Provided in text_to_speech_

16 January 2023

TTS

Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS

MSMC-TTS: Multi-Stage Multi-Codebook TTS Official Implement of MSMC-TTS System of papers “A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS” and “Towards High-Quality Neural TTS for Low-Resource Languages by Learning

16 January 2023

Text-to-Speech

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform Masaya Kawamura, Yuma Shirahata, Ryuichi Yamamoto, Kentaro Tachibana We propose a lightweight end-to-end text-to-speech model using multi-band generation and inverse

16 January 2023

TTS

Vits chinese, tts chinese, tts mandarin

基于VITS 实现 16K baker TTS 的流程记录

将16K标贝音频拷贝到./baker_waves/，启动训练

测试

样例音频

GitHub

John

Stream your favorit movie from the terminal

File Sharing Bot v2

基于VITS 实现 16K baker TTS 的流程记录

将16K标贝音频拷贝到./baker_waves/，启动训练

测试

样例音频

GitHub

Stream your favorit movie from the terminal

File Sharing Bot v2

You might also like...