Discrete Audio and Speech Benchmark
Benchmark for a diverse set of discrete audio encoders from all three categories: semantic, compression, and hybrid.
Discrete Audio and Speech Benchmark
DASB is a benchmark for assessing discrete audio tokens across various tasks. It includes different evaluation metrics, downstream architectures, and bitrates for thorough comparisons. The system also features an automated pipeline for dataset downloading, dataloading, evaluation, and leaderboard submission. DASB evolves based on community feedback. To contribute your audio tokenizer or report issues, please email us or visit our GitHub page.
