See also my Google Scholar.
Video-Guided Text-to-Music Generation Using Public Domain Movie Collections
Haven Kim, Zachary Novack, Weihan Xu, Julian McAuley, and Hao-Wen Dong
International Society for Music Information Retrieval Conference (ISMIR), 2025
paper
demo
code
poster
reviews
Music Generation Multimodal Learning
Synthesizing Composite Hierarchical Structure from Symbolic Music Corpora
Ilana Shapiro, Ruanqianqian Huang, Zachary Novack, Cheng-I Wang, Hao-Wen Dong, Taylor Berg-Kirkpatrick, Shlomo Dubnov, and Sorin Lerner
International Joint Conference on Artificial Intelligence (IJCAI), 2025
paper
code
ViolinDiff: Enhancing Expressive Violin Synthesis with Pitch Bend Conditioning
Daewoong Kim, Hao-Wen Dong, and Dasaem Jeon
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2025
paper
demo
code
reviews
Audio Synthesis Music Performance Rendering
Nested Music Transformer: Sequentially Decoding Compound Tokens in Symbolic Music and Audio Generation
Jiwoo Ryu, Hao-Wen Dong, Jongmin Jung, and Dasaem Jeon
International Society for Music Information Retrieval Conference (ISMIR), 2024
paper
demo
code
reviews
Music Generation
CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models
Hao-Wen Dong, Xiaoyu Liu, Jordi Pons, Gautam Bhattacharya, Santiago Pascual, Joan Serrà, Taylor Berg-Kirkpatrick, and Julian McAuley
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2023
paper
demo
video
slides
reviews
Oral presentation Audio Synthesis Multimodal Learning
Multitrack Music Transformer
Hao-Wen Dong, Ke Chen, Shlomo Dubnov, Julian McAuley, and Taylor Berg-Kirkpatrick
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023
paper
demo
video
slides
code
reviews
Oral presentation Music Generation
CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos
Hao-Wen Dong, Naoya Takahashi, Yuki Mitsufuji, Julian McAuley, and Taylor Berg-Kirkpatrick
International Conference on Learning Representations (ICLR), 2023
paper
demo
video
slides
poster
code
reviews
Sound Separation Multimodal Learning
Improving Choral Music Separation through Expressive Synthesized Data from Sampled Instruments
Ke Chen, Hao-Wen Dong, Yi Luo, Julian McAuley, Taylor Berg-Kirkpatrick, Miller Puckette, and Shlomo Dubnov
International Society for Music Information Retrieval Conference (ISMIR), 2022
paper
demo
code
reviews
Sound Separation
Deep Performer: Score-to-Audio Music Performance Synthesis
Hao-Wen Dong, Cong Zhou, Taylor Berg-Kirkpatrick, and Julian McAuley
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022
paper
demo
video
slides
poster
reviews
Audio Synthesis Music Performance Rendering
Towards Automatic Instrumentation by Learning to Separate Parts in Symbolic Multitrack Music
Hao-Wen Dong, Chris Donahue, Taylor Berg-Kirkpatrick and Julian McAuley
International Society for Music Information Retrieval Conference (ISMIR), 2021
paper
demo
video
slides
code
reviews
Music Compositional Tools
An Empirical Evaluation of End-to-End Polyphonic Optical Music Recognition
Sachinda Edirisooriya, Hao-Wen Dong, Julian McAuley and Taylor Berg-Kirkpatrick
International Society for Music Information Retrieval Conference (ISMIR), 2021
paper
code
reviews
Optical Music Recognition
MusPy: A Toolkit for Symbolic Music Generation
Hao-Wen Dong, Ke Chen, Julian McAuley, and Taylor Berg-Kirkpatrick
International Society for Music Information Retrieval Conference (ISMIR), 2020
paper
video
slides
poster
code
documentation
reviews
Infrastructure
Convolutional Generative Adversarial Networks with Binary Neurons for Polyphonic Music Generation
Hao-Wen Dong and Yi-Hsuan Yang
International Society for Music Information Retrieval Conference (ISMIR), 2018
paper
demo
video
slides
poster
code
reviews
Music Generation
The Name-Free Gap: Policy-Aware Stylistic Control in Music Generation
Ashwin Nagarajan and Hao-Wen Dong
NeurIPS Workshop on Artificial Intelligence for Music (AI4Music), 2025
paper
demo
code
Music Generation
Video-to-Music Generation for Film Production: A Dataset and Framework
Haven Kim, Leduo Chen, Bill Wang, Hao-Wen Dong, and Julian McAuley
NeurIPS Workshop on Artificial Intelligence for Music (AI4Music), 2025
Multimodal Learning
Curating an A Cappella Dataset for Source Separation
Ting-Yu Pan*, Kexin Phyllis Ju*, and Hao-Wen Dong (*equal contribution)
ISMIR Late-Breaking Demos, 2025
paper
demo
poster
Sound Separation
On Output Activation Functions for Adversarial Losses: A Theoretical Analysis via Variational Divergence Minimization and An Empirical Study on MNIST Classification
Hao-Wen Dong and Yi-Hsuan Yang
arXiv preprint arXiv:1901.08753, 2019
paper
demo
code
Fundamental Machine Learning
Training Generative Adversarial Networks with Binary Neurons by End-to-end Backpropagation
Hao-Wen Dong and Yi-Hsuan Yang
arXiv preprint arXiv:1810.04714, 2018
paper
demo
slides
code
Fundamental Machine Learning