I am a computer science PhD student at Mila and Concordia University, supervised by Professor Mirco Ravanelli. I have a broad interest in deep learning for Conversational AI. My research focuses on discrete self-supervised learning for speech and audio, exploring its potential to bridge audio and language models. I am also one of the main contributors to the SpeechBrain project, a popular open-source conversational AI toolkit.
We are excited to announce the launch of our DASB leaderboard!
Excited to annouce our paper"How Should We Extract Discrete Audio Tokens from Self-Supervised Models?" has been accepted at Interspeech 2024 for an oral presentation.
Happy to share a preprint from our recent work "DASB - Discrete Audio and Speech Benchmark". Thanks to all my amazing collaborators.
Mila/Concordia University (Gina Cody School of Engineering and Computer Science)Sep. 2022 - present
PhD in Computer Science
University of Texas at Dallas (UTD)2018 - 2021
M.S. in Computer Science
Most recent publications on Google Scholar.
‡ indicates equal contribution.
How Should We Extract Discrete Audio Tokens from Self-Supervised Models?
Pooneh Mousavi, Jarod Duret and , Salah Zaiem, Luca Della Libera, Artem Ploujnikov, Cem Subakan, Mirco Ravanelli
Proc. of Interspeech, 2024, Oral Session
DASB - Discrete Audio and Speech Benchmark.
Pooneh Mousavi, Luca Della Libera, Jarod Duret, Artem Ploujnikov, Cem Subakan, Mirco Ravanell
Submitted to NeurIPS 2024 | Track Datasets and Benchmarks Submission
Open-Source Conversational AI with SpeechBrain 1.0
Mirco Ravanelli, Titouan Parcollet, et al
Submitted to JMLR (Machine Learning Open Source Software)
CL-MASR: A Continual Learning Benchmark for Multilingual ASR
Luca Della Libera‡, Pooneh Mousavi‡, Salah Zaiem, Cem Subakan, Mirco Ravanelli
Submitted to Transactions on Audio, Speech and Language Processing
WOODS: Benchmarks for Out-of-Distribution Generalization in Time Series
Jean-Christophe Gagnon-Audet, Kartik Ahuja, Mohammad Javad Darvishi Bayazi, Pooneh Mousavi, Guillaume Dumas, Irina Rish
Transactions on Machine Learning Research
Detecting Hashtag Hijacking for Hashtag Activism
Pooneh Mousavi, Jessica Ouyang
ACL | IJCNLP | NLP4PosImpact
Please Donate for the Affected:Supporting Emergency Managers in Finding Volunteers and Donations in Twitter Across Disasters
Pooneh Mousavi, Cody Buntain
ISCRAM 2022
How Should We Extract Discrete Audio Tokens from Self-Supervised Models?
Pooneh Mousavi, Jarod Duret and , Salah Zaiem, Luca Della Libera, Artem Ploujnikov, Cem Subakan, Mirco Ravanelli
Proc. of Interspeech, 2024, Oral Session
DASB - Discrete Audio and Speech Benchmark.
Pooneh Mousavi, Luca Della Libera, Jarod Duret, Artem Ploujnikov, Cem Subakan, Mirco Ravanell
Submitted to NeurIPS 2024 | Track Datasets and Benchmarks Submission
Open-Source Conversational AI with SpeechBrain 1.0
Mirco Ravanelli, Titouan Parcollet, et al
Submitted to JMLR (Machine Learning Open Source Software)
CL-MASR: A Continual Learning Benchmark for Multilingual ASR
Luca Della Libera‡, Pooneh Mousavi‡, Salah Zaiem, Cem Subakan, Mirco Ravanelli
Submitted to Transactions on Audio, Speech and Language Processing
WOODS: Benchmarks for Out-of-Distribution Generalization in Time Series
Jean-Christophe Gagnon-Audet, Kartik Ahuja, Mohammad Javad Darvishi Bayazi, Pooneh Mousavi, Guillaume Dumas, Irina Rish
Transactions on Machine Learning Research
Detecting Hashtag Hijacking for Hashtag Activism
Pooneh Mousavi, Jessica Ouyang
ACL | IJCNLP | NLP4PosImpact
Please Donate for the Affected:Supporting Emergency Managers in Finding Volunteers and Donations in Twitter Across Disasters
Pooneh Mousavi, Cody Buntain
ISCRAM 2022
Conversational AI, Concordia University
Winter 2023, Winter 2024, See info here
Full Resume in PDF.