Skip to main content
Start main content

Distinguished Lectures in Humanities: Attentive Listening by Humans and Machines

Distinguished Lectures in Humanities

20251104_1000x540
  • Date

    04 Nov 2025

  • Organiser

    Faculty of Humanities

  • Time

    11:00 - 12:30

  • Venue

    UG05, PolyU HHB Campus & Zoom  

Remarks

The talk will be conducted in English.

Summary

Abstract

Humans have a remarkable ability to pay their auditory attention only to a sound source of interest, that we call selective auditory attention, in a multi-talker environment or a Cocktail Party. As discovered in neuroscience and psychoacoustics, the auditory attention is achieved by a modulation of top-down and bottom-up attention. However, signal processing approach to speech separation and/or speaker extraction from multi-talker speech remains a challenge for machines. In this talk, we study the deep learning solutions to monaural speech separation and speaker extraction that enable selective auditory attention. We review the findings from human audio-visual speech perception to motivate the design of speech perception algorithms. We will also discuss the computational auditory models, technical challenges and the recent advances in the field.


About the speaker

Professor Haizhou LI is a Fellow of the Singapore Academy of Engineering, a Fellow of the IEEE, and a Fellow of International Speech Communication Association. He is currently the Dean of the School of Artificial Intelligence and Presidential Chair Professor at The Chinese University of Hong Kong, Shenzhen. He also serves as Adjunct Professor at the National University of Singapore and U Bremen Excellence Chair Professor at the University of Bremen, Germany.

Professor LI has made outstanding contributions to speech recognition and natural language processing. He has led the development of multiple major technology deployments, including the voiceprint recognition engine for Lenovo’s A586 smartphone in 2012, and the music search engine for Baidu Music in 2013. He was the Editor-in-Chief of IEEE/ACM Transactions on Audio, Speech and Language Processing, the President of International Speech Communication Association, and a Vice President of IEEE Signal Processing Society.


POSTER_Prof LI Haizhou  

Your browser is not the latest version. If you continue to browse our website, Some pages may not function properly.

You are recommended to upgrade to a newer version or switch to a different browser. A list of the web browsers that we support can be found here