Interspeech 2025 Best Student Papers
Winners
"On the Relationship between Accent Strength and Articulatory Features", Kevin Huang, Sean Foley, Jihwan Lee, Yoonjeong Lee, Dani Byrd, Shrikanth Narayanan
"OWSM v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning Models", Yifan Peng, Muhammad Shakeel, Yui Sudo, William Chen, Jinchuan Tian, Chyi-Jiunn Lin and Shinji Watanabe
"Attention Models and Auditory Transduction Features for Noise Robustness", Cathal Ó Faoláin and Andrew Hines
Nominees
Piotr Masztalski, Michał Romaniuk, Jakub Żak, Mateusz Matuszewski, Konrad Kowalczyk, "Clustering-based hard negative sampling for supervised contrastive speaker verification".
Junqi Yang, Yuhong Yang, Weiping Tu, Xin Zhao, Cedar Lin, "Band-SCNet: A Causal, Lightweight Model for High-Performance Real-Time Music Source Separation".
Seungu Han, Sungho Lee, Juheon Lee, Kyogu Lee, "Few-step Adversarial Schrödinger Bridge for Generative Speech Enhancement".
Chao Yi-Wen, Yizhou Peng, Dianwen Ng, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma, "A-SMiLE: Affective Sparse Mixture-of-Experts Adapter with Multi-Task Learning for Spoken Dialogue Models".
Tianhua Qi, Shiyan Wang, Cheng Lu, Tengfei Song, Hao Yang, Zhanglin Wu, Wenming Zheng, "PromptEVC: Controllable Emotional Voice Conversion with Natural Language Prompts".
Yifan Peng, Shakeel Muhammad, Yui Sudo, William Chen, Jinchuan Tian, Chyi-Jiunn Lin, Shinji Watanabe, "OWSM v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning".
Minji Ryu, Ji-Hyeon Hur, Sung Heuk Kim, Gahgene Gweon, "Pitch Contour Model (PCM) with Transformer Cross-Attention for Speech Emotion Recognition".
Byeong Hyeon Kim, Hyungseob Lim, Inseon Jang, Hong-Goo Kang, "Towards an Ultra-Low-Delay Neural Audio Coding with Computational Efficiency".
Filippo Villani , Wai-Yip Chan, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, "Analysis and Extension of a Near-End Listening Enhancement Method Based on Long-Term Fractile Noise Statistics".
Wang Dong, Jiqing Han, Tieran Zheng, Guibin Zheng, Yongjun He, "Dual Orthogonality Sub-center Loss for Enhanced Anomalous Sound Detection".
Cathal Ó Faoláin, Andrew Hines, "Attention Models and Auditory Transduction Features for Noise Robustness".
Kevin Huang, Sean Foley, Jihwan Lee, Yoonjeong Lee, Dani Byrd, Shrikanth Narayanan, "On the Relationship between Accent Strength and Articulatory Features".
Le Xuan Chan, Annika Heuser, "Relative cue weighting in multilingual stop voicing production".
Jingran Xie, Xiang Li, Hui Wang, Yue Yu, Yang Xiang, Xixin Wu, Zhiyong Wu, "Enhancing Generalization of Speech Large Language Models with Multi-Task Behavior Imitation and Speech-Text Interleaving".
Chetan Sharma, Vaishnavi Chandwanshi, Shreya Shrikant Karkun, Aditya Anand Gupta, Prasanta Kumar Ghosh, "A real-time MRI study on asymmetry in velum dynamics during VCV production with nasal sounds".
Interspeech 2025
PCO: TU Delft Events
Delft University of Technology
Communication Department
Prometheusplein 1
2628 ZC Delft
The Netherlands
Email: pco@interspeech2025.org
X (formerly Twitter): @ISCAInterspeech
Bluesky: @interspeech.bsky.social
Interspeech 2025 is working under the privacy policy of TU Delft
Interspeech 2025
Interspeech 2025interspeech2025@tudelft.nl
Interspeech 2025interspeech2025@tudelft.nlhttps://www.interspeech2025.org
2025-08-17
2025-08-21
OfflineEventAttendanceMode
EventScheduled
Interspeech 2025Interspeech 20250.00EUROnlineOnly2019-01-01T00:00:00Z
To be announcedTo be announced