DASB | Documentation for the DASB Benchmark.

Discrete Audio and Speech Benchmark

DASB is a benchmark for assessing discrete audio tokens across various tasks. It includes different evaluation metrics, downstream architectures, and bitrates for thorough comparisons. The system also features an automated pipeline for dataset downloading, dataloading, evaluation, and leaderboard submission. DASB evolves based on community feedback. To contribute your audio tokenizer or report issues, please email us or visit our GitHub page.

Diverse Tasks

We consider a wide range of discriminative tasks and generative from all three speech, music and general sound domains.

Multiple Tokenizer

It supports a range of discrete audio encoders across three categories: semantic, acoustic, and hybrid.

Unified Evaluation

DASB is a modular code repository built on the `SpeechBrain`toolkit to ensure reproducible and standardized evaluations.