Dr. Shuai Wang is currently a Tenure-Track Associate Professor at the School of Intelligence Science and Technology, Nanjing University. He earned his B.E. degree from Northwestern Polytechnical University in 2014 under the supervision of Prof. Lei Xie, and his Ph.D. degree from Shanghai Jiao Tong University in 2020 under the supervision of Prof. Kai Yu and Prof. Yanmin Qian. Prior to joining Nanjing University, he served as a research scientist in Prof. Haizhou Liโ€™s team at the Shenzhen Research Institute of Big Data, Chinese University of Hong Kong (Shenzhen), where he still holds an adjunct position now. Additionally, he spent 2.5 years as a senior research scientist at Lightspeed & Quantum Studios, Tencent, where he led the speech group in R&D of speech technologies customized for games.

His research interest includes speaker modeling,target speaker processing, speech synthesis, voice conversion and music generation. He has published more than 60 papers at top-tier speech conferences/journals.

๐Ÿ”ฅ Openings

I will have several openings for graduate students (2026 Fall), will update details ASAP. I am currently looking for research assistants, please feel free to drop me an email with your CV if you are interested in the following topics:

  • Speaker Modeling
  • Target Speaker Processing
  • Speech Generation
  • Music Generation
  • Brain-inspired speech processing

Note that research assistants can choose to work either in Nanjing University @ Suzhou or in Chinese University of Hong Kong (Shenzhen), jointly supervised with Prof. Haizhou Li.

๐Ÿ‘จโ€๐ŸŽ“ Students

Ph.D. students jointly supervised with Prof. Haizhou Li

  • Chenyu Yang, CUHK-Shenzhen, Music Generation, Intern at Tencent AILab ๏ผˆ็Š€็‰›้ธŸไบบๆ‰่ฎกๅˆ’๏ผ‰.
  • Zhijun Liu, CUHK-Shenzhen, Speech Synthesis, Intern at NetEase and Bytedance (TopSeed)
  • Sho Inoue, CUHK-Shenzhen, Speech Syntesis, Intern at NetEase and Meta FAIR.
  • Qibing Bai, CUHK-Shenzhen, Accent Conversion, Intern at Tencent TEA-Lab
  • Wenxuan Wu, CUHK, Target Speech Extraction
  • Wupeng Wang, NUS, Speech Separation

Past students

  • Junjie Li, currently Ph.D. student at The Hong Kong Polytechnic University

๐Ÿ“ Publications

Please check my Google Scholar for the latest publications.

๐Ÿชœ Open-Source Projects

  • WeSpeaker: Speaker Embedding Learning
  • WeSep: Target Speaker Extraction.
  • DiffRhythm: Diffusion-based Rhythmic Music Generation.

๐ŸŽ– Honors and Awards

  • 2024 Best Paper Award, ISCSLP 2024
  • 2024 Best Student Paper Award, ISCSLP 2024
  • 2019 VoxSRC 2019: Rank 1st in both 2 Tracks
  • 2019 DIHARD 2019: Rank 1st in both 4 Tracks
  • 2018 IEEE Ganesh N. Ramaswamy Memorial Student Grant Award

๐ŸŒ… Services

I serve as a regular reviewer for multiple conferences and journals, including

  • ICASSP, Interspeech, ASRU, SLT, T-ASLP, Computer Speech & Language, Speech Communication;
  • ICML, Neurips, AAAI, ACM MM.

I serve as the Specical Session Chair of APSIPA 2025, the Operation Chair of ICASSP 2025 Suzhou Satellite Event, the Publication Chair of SLT 2024.

๐Ÿ’ฌ Invited Talks