My research aims at augmenting human creativity with generative AI. I develop human-centered generative AI technology that can be integrated into professional creative workflows, with a focus on music, audio, and video creation. My long-term goal is to make professional content creation accessible to everyone.
My research interests include music generation, music technology, audio synthesis, video editing, and multimodal AI. Here are the three main pillars of my research:
I develop generative models for music creation where I pioneer the adoption of deep neural networks for generating multi-instrument music. Topics include multitrack music generation (AAAI 2018, ISMIR 2018, ISMIR 2020, ICASSP 2023, ISMIR 2024), text-to-music generation (ISMIR 2025), video-to-music generation (ISMIR 2025), and symbolic music processing tools (ISMIR LBD 2019, ISMIR 2020).
I build AI-assisted music creation tools that aim to augment human creativity in their creative workflow. Topics include expressive violin performance synthesis (ICASSP 2022, ICASSP 2025), music instrumentation (ISMIR 2021), music arrangement (AAAI 2018), and music harmonization (JNMR 2020).
I develop multimodal generative models for content creation that can process, understand and generate data in multiple modalities at the same time. Topics include long-to-short video editing (ICLR 2025, NeurIPS 2025), text-queried sound separation (ICLR 2023), and text-to-audio synthesis (WASPAA 2023).
Currently, I am most interested in multimodal generative AI and human-AI co-creative tools for music, audio, and video creation.
University of California San Diego M.S. in Computer Science
Sep 2019 – Jun 2021
National Taiwan Normal University Digital Video and Audio Arts Program
Sep 2019 – Jun 2017
National Taiwan University B.S. in Electrical Engineering
Sep 2013 – Jun 2017
Professional Experience
NVIDIA Research Intern
Deep Imagination Research Group, NVIDIA Research
Advisors: Siddharth Gururani and Ming-Yu Liu
Topic: Controllable audio generation
Sep 2023 – Dec 2023
Adobe Research Scientist/Engineer Intern
Audio Research Group, Adobe Research
Advisors: Justin Salamon and Oriol Nieto
Topic: Text-to-audio retrieval
May 2023 – Sep 2023
Dolby Speech/Audio Deep Learning Intern
Applied AI Team, Advanced Technology
Advisor: Xiaoyu Liu
Topic: Text-to-audio synthesis
Jan 2023 – Apr 2023
Amazon Applied Scientist Intern
Natural Understanding Team, Alexa AI
Advisors: Wenbo Zhao and Gunnar Sigurdsson
Topic: Text-to-audio synthesis
Sep 2022 – Jan 2023
Sony Student Intern
Tokyo Laboratory 30, R&D Center
Advisor: Naoya Takahashi
Topic: Universal sound separation
May 2022 – Sep 2022
Dolby Deep Learning Audio Intern
Applied AI Team, Advanced Technology Group
Advisor: Cong Zhou
Topic: Music performance synthesis
Jun 2021 – Sep 2021
Yamaha Research Intern
AI Group, Research and Development Division
Advisor: Keijiro Saino
Topic: Deep learning based synthesizer
May 2019 – Aug 2019
Academia Sinica Research Assistant
Music and AI Lab, Research Center for IT Innovation
Advisor: Yi-Hsuan Yang
Topics: Music generation and deep generative models