Installation

Requirements

  • Python ≥ 3.12

  • uv (recommended) or pip

Clone and install

git clone <repo-url>
cd speech_enhancement

# With uv (recommended)
uv pip install -e .

# Or with pip inside a virtual environment
pip install -e .

Optional documentation dependencies

To build these docs locally, install Sphinx and the Furo theme:

uv pip install sphinx furo

Then from the docs/ directory:

make html
xdg-open _build/html/index.html   # Linux
open _build/html/index.html        # macOS

Data

Download the datasets and place them under data/ in the repository root.

LibriSpeech (train + test):

data/
├── train-clean-100/LibriSpeech/train-clean-100/<speaker>/<chapter>/*.flac
└── test-clean/LibriSpeech/test-clean/<speaker>/<chapter>/*.flac

Download from https://www.openslr.org/12 (~6.3 GB for train-clean-100).

DEMAND noise corpus:

data/noise/DEMAND/<ENVIRONMENT_16k>/ch01.wav

Download from https://zenodo.org/record/1227121 and extract under data/noise/DEMAND/.