Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis

Aug 11, 2021 1 min read

TalkNet 2 [WIP]

TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.

Official TalkNet 2 repo here

Work remains:

Add masking to all QuartzNet Blocks.
Add PostNet to Mel-Spectrogram generator.
Clean up and modify all model implementation as per best practices.
Add Text and Audio processing code.
Add dataloader and training code.
Test the whole Talknet2 setup and post result.

Citation:

@misc{beliaev2021talknet,
      title={TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model Stanislav Beliaev, Boris Ginsburgfor Speech Synthesis with Explicit Pitch and Duration Prediction}, 
      author={Stanislav Beliaev and Boris Ginsburg},
      year={2021},
      eprint={2104.08189},
      archivePrefix={arXiv},
      primaryClass={eess.AS}
}

GitHub

https://github.com/rishikksh20/TalkNet2-pytorch

John was the first writer to have joined pythonawesome.com. He has since then inculcated very effective writing and reviewing culture at pythonawesome which rivals have found impossible to imitate.

Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis

TalkNet 2 [WIP]

Work remains:

Citation:

GitHub

John

An Unsupervised Aligned Keypoint Detector in python

Few-shot Graph Learning for Molecular Property Prediction

TalkNet 2 [WIP]

Work remains:

Citation:

GitHub

An Unsupervised Aligned Keypoint Detector in python

Few-shot Graph Learning for Molecular Property Prediction

You might also like...