A minimal code for fairseq vq-wav2vec model inference

Dec 14, 2021 1 min read

vq-wav2vec inference

A minimal code for inference. Runs without installing the fairseq toolkit and its dependencies.

Usage example:

import torch
import fairseq
from models.wav2vec import Wav2VecModel

cp = torch.load('/path/to/vq-wav2vec.pt')
model = Wav2VecModel.build_model(cp['args'], task=None)
model.load_state_dict(cp['model'])
model.eval()

wav_input_16khz = torch.randn(1,10000)
z = model.feature_extractor(wav_input_16khz)
print(z[0].T.detach().numpy().shape) # output: (60, 512)
_, idxs = model.vector_quantizer.forward_idx(z)
print(idxs.shape) # output: torch.Size([1, 60, 2]), 60 timesteps with 2 indexes corresponding to 2 groups in the model

GitHub

View Github

Toolkit

John was the first writer to have joined pythonawesome.com. He has since then inculcated very effective writing and reviewing culture at pythonawesome which rivals have found impossible to imitate.

A minimal code for fairseq vq-wav2vec model inference

vq-wav2vec inference

Usage example:

GitHub

John

Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining

Wrapper for wttr.in weather forecast

vq-wav2vec inference

Usage example:

GitHub

Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining

Wrapper for wttr.in weather forecast

You might also like...