website banner

Interspeech 2025 Accepted Tutorials​​​​​

Tutorials date: Sunday, August 17th

The schedule for the accepted tutorials for Interspeech 2025 is now live! You may click on the corresponding slot to learn more about the tutorials within that slot. The short version of the schedule is the following:

Slot 1 (8:30-11:30):

  • Option 1: "Speech Technology Meets Early Language Acquisition: How Interdisciplinary Efforts Benefit Both Fields", by Maureen de Seyssel (Apple), Emmanuel Dupoux (École Normale Supérieure, Paris & Meta), Okko Rasanen (Tampere University)

  • Option 2: "Confidence Estimation for Trustworthy and Efficient Speech Systems", by Nagarathna Ravi (Big Data Research and Supercomputing Division, India), Thishyan Raj T (Department of Electrical Engineering, IIT Kanpur), Aditya Raj (Department of Electrical Engineering, IIT Kanpur), Vipul Arora (Department of Electrical Engineering, IIT Kanpur)

  • Option 3: "Extracting insights from your complex data: interpretable statistical methods in speech science", by Tyson Barrett (Utah State University & Highmark Health), Tristan J. Mahr (University of Wisconsin-Madison), Visar Berisha (Arizona State University), Camille Wynne (University of Houston)

Slot 2 (12:00-15:00):

  • Option 1: Invited Tutorial: "Creating sound with Praat", by Paul Boersma (University of Amsterdam), author of Praat

  • Option 2: "A Journey through Emerging Speech Research with NVIDIA NeMo", by Piotr Zelasko (NVIDIA), Nithin Rao Koluguri (NVIDIA), Ante Jukic (NVIDIA), Subhankar Ghosh (NVIDIA), Travis Bartley (NVIDIA), Elena Rastorgueva (NVIDIA), Taejin Park (NVIDIA)

  • Option 3: "Tutorial on Speech Watermarking", by Patrick OReilly (Northwestern University), Bryan Pardo (Northwestern University)

Slot 3 (15:30-18:30):

  • Option 1: "Interpretability Techniques for Speech Models", by Charlotte Pouw (University of Amsterdam), Gaofei Shen (Tilburg University), Martijn Bentum (Radboud University Nijmegen), Marianne de Heer Kloots (University of Amsterdam), Tomas Lentz (Tilburg University), Hosein Mohebbi (Tilburg University), WIllem Zuidema (University of Amsterdam), Grzegorz Chrupała (Tilburg University)

  • Option 2: "Automatic Quality Assessment for Speech and Beyond", by Wen-Chin Huang (Nagoya University), Erica Cooper (NICT), Jiatong Shi (Carnegie Mellon University)

  • Option 3: "Beyond End-to-End ASR: Integrating Long-Context Acoustic and Linguistic Insights", by Taejin Park (NVIDIA), Huck Yang (NVIDIA), Kyu Han (Amazon Web Services), Shinji Watanabe (Carnegie Mellon University)