SMM26

Keynote Speakers

Title:

AI & The Sound of Mental Health: Remixed Feelings

Abstract:

Mental health has a sound, and AI is beginning to hear and mix it. In the voice and language, we find traces of affect, depression, and recovery that can open new pathways for scalable and personalised care. Starting there, we will explore how speech-based digital psychology is expanding from diagnosis toward intervention: from vocal and linguistic biomarkers of mental well-being to large language models that support rather than simply assess, reaching to generative music systems that enable closed-loop personalised emotional regulation. This convergence of speech AI, interventive language technology, and generative audio suggests a future in which intelligent systems can listen, understand, and respond in psychologically meaningful ways. Realising that future, however, requires more than technical performance. It demands beyond clinical relevance reliability, safety, explainability, and trust. I will discuss the promise, route, and responsibility of building AI that does not merely analyse the mind, but helps care and deejay for it.

Industrial Speaker

Dr. Yuki Mitsufuji

Title:

AI for Creators: Pushing Creative Abilities to the Next Level

Abstract:

This talk explores how cutting-edge generative AI is transforming creative workflows in music, cinema, and gaming. Led by Dr. Yuki Mitsufuji, the Music Foundation Model Team at Sony AI has developed multimodal frameworks such as MMAudio, which generate high-quality, synchronized audio from video and text inputs. Their research, recognized at top venues like NeurIPS, ICLR, and CVPR, has contributed to both content creation and protection, with practical demos integrated into commercial products. The session will highlight key innovations, including sound restoration projects and the future of AI-powered media production.

Keynote Speakers

Professor. Björn Schuller

Title:

Abstract:

Industrial Speaker

Dr. Yuki Mitsufuji

Title:

Abstract: