Discrete Audio and Speech Benchmark

Benchmark for a diverse set of discrete audio encoders from all three categories: semantic, acoustic, and hybrid.

It covers three domains of speech, music and general sound.

Discrete Audio and Speech Benchmark

DASB is a benchmark for assessing discrete audio tokens across various tasks. It includes different evaluation metrics, downstream architectures, and bitrates for thorough comparisons. The system also features an automated pipeline for dataset downloading, dataloading, evaluation, and leaderboard submission. DASB evolves based on community feedback. To contribute your audio tokenizer or report issues, please email us or visit our GitHub page.
Jekyll logo

Diverse Tasks

We consider a wide range of discriminative tasks and generative from all three speech, music and general sound domains.

Multiple Tokenizer

It supports a range of discrete audio encoders across three categories: semantic, acoustic, and hybrid.

Unified Evaluation

DASB is a modular code repository built on the SpeechBraintoolkit to ensure reproducible and standardized evaluations.

SpeechBrain logo