A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) based on Deep Filtering.
libDFcontains Rust code used for data loading and augmentation.
DeepFilterNetcontains Python code including a libDF wrapper for data loading, DeepFilterNet training, testing and visualization.
modelscontains DeepFilterNet model weights and config.
System requirements are
pip (Rust and Python package managers). Usage of a
virtualenv recommended. This framework is currently only tested under Linux.
Installation of python dependencies and libDF:
cd path/to/DeepFilterNet/ # cd into repository # Recommended: Install or activate a python env. pip install maturin # Used to compile libDF and load maturin build --release -m DeepFilterNet/Cargo.toml # Build python wheel # Install python wheel. Make sure to specify the correct DeepFilterNet and python version pip install target/wheels/DeepFilterNet-0.1.0-cp39-cp39-linux_x86_64.whl # Optional: Install cuda version of pytorch from pytorch.org pip install -r requirements.txt # Install remaining dependencies
To enhance noisy audio files using DeepFilterNet run
# usage: enhance.py [-h] [--output-dir OUTPUT_DIR] model_base_dir noisy_audio_files [noisy_audio_files ...] python DeepFilterNet/df/enhance.py models/DeepFilterNet/ path/to/noisy_audio.wav
DeepFilterNet is free and open source! All code in this repository is dual-licensed under either:
- MIT License (LICENSE-MIT or http://opensource.org/licenses/MIT)
- Apache License, Version 2.0 ( or http://www.apache.org/licenses/LICENSE-2.0)
at your option. This means you can select the license you prefer!
Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.