Distinguished Seminar Series on Data Science & Artificial Intelligence - "The Power and Vulnerability of Multimodal LLMs" by Dr Liangliang Cao

Research Seminar

Add to Calendar

Date

14 Nov 2025
Organiser

Department of Computing
Time

11:00 - 12:00
Venue

Online via Zoom / PQ306

Speaker

Dr Liangliang Cao

Summary

Modern AI has grown so powerful that it has transformed how humans interact with machines. In the first part of this talk, I will share my experience of this evolution over the past fifteen years—from winning the first ImageNet Challenge in 2010 to recently co-leading Gemini Live at Google DeepMind in 2025—and trace how large-scale data and multimodal learning have driven progress across text, image, and speech domains. In the second part, I will discuss the intrinsic vulnerabilities of these systems, including data and model limitations, as well as the growing challenge of power consumption. I will argue that despite the costly industrial race for ever-larger models, there remain profound research opportunities for academia to make AI more efficient, reliable, and beneficial to human life.

Keynote Speaker

Dr Liangliang Cao

Principal Engineer and Director

Gemini Team

Google

United States

Dr Liangliang Cao received his PhD from the University of Illinois at Urbana–Champaign, his MPhil from the Chinese University of Hong Kong, and his B.S. from the University of Science and Technology of China. After completing his PhD, he joined the IBM Watson Research Center and later co-founded Switi Inc., a startup acquired by Google in 2018. He subsequently founded and led the Google Cloud Speech Modeling team (2018–2021), served as the model DRI for Apple Intelligence (2023–2024), and most recently worked as a Principal Engineer and Director on the Gemini team at Google DeepMind (2024–2025). Before the pandemic, Dr Cao served as an adjunct professor at Columbia University and also at the University of Massachusetts Amherst. He is an IEEE Fellow and also a recipient of the ACM SIGMM Rising Star Award.