This repository is the official implementation of "MusCaps: Generating Captions for Music Audio" (IJCNN 2021).
This repository is a proof of principle for performing Molecular Dynamics analysis, in this case with the program VMD, via natural language commands.
This repo contains the PyTorch implementation for paper "The Boombox: Visual Reconstruction from Acoustic Vibrations".
PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.
PyTorch implementation for the paper "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis"
AugLy is a data augmentations library that currently supports four modalities (audio, image, text & video) and over 100 augmentations.
PyAbsorp is a python module that has the main focus to help estimate the Sound Absorption Coefficient.
A GUI-based audio player with support for a large variety of formats, able to play from web-hosted media platforms such as YouTube
Pydiogment aims to simplify audio augmentation. It generates multiple audio files based on a starting mono audio file.
Note length, tempo, velocity, progression and other stuff are minor worries in comparison. Those things can be felt, without learning.
tinytag is a library for reading music meta data of MP3, OGG, OPUS, MP4, M4A, FLAC, WMA and Wave files with python.
TimeSide is a python framework enabling low and high level audio analysis, imaging, transcoding, streaming and labelling.
Tap into The Echo Nest's Musical Brain for the best music search, information, recommendations and remix tools on the web.
mingus is a package for Python used by programmers, musicians, composers and researchers to make and analyse music.
eyeD3 is a Python tool for working with audio files, specifically MP3 files containing ID3 metadata (i.e. song info).
django-elastic-transcoder is an Django app, let you integrate AWS Elastic Transcoder in Django easily.