Implementation of DeepSpeech2 for PyTorch using PyTorch Lightning
Codename generator using WordNet parts of speech database
A port of Coqui STT based on DeepSpeech to PyTorch
PIKA: a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture
A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement
AutoSub is a CLI application to generate subtitle file (.srt) for any video file using Mozilla DeepSpeech.
The purpose of this code base is to add a specified signal-to-noise ratio noise from MUSAN dataset to a pure speech signal and to generate far-field
Free medium-quality text-to-speech software, VOICEVOX speech synthesis engine
We examine the use of the Conformer architecture for continuous speech separation. Conformer allows the separation model to efficiently capture both local and global context information, which is helpful for speech separation.
This repository provides official implementation of deep Gaussian process (DGP)-based multi-speaker speech synthesis with PyTorch.