Hao-Wen (Herman) Dong
董皓文
Assistant Professor
Performing Arts Technology
School of Music, Theatre & Dance
University of Michigan
ude.hcimu@gnodwh
Stearns 131
Home
Research
Publications
Talks
Teaching
Resources
Interests
CV
Google Scholar
LinkedIn
Twitter
GitHub
Dark mode
Light mode
Hosted on GitHub Pages. Powered by
Jekyll
. Theme adapted from
minimal
by
orderedlist.
Best Papers from Selected Conferences/Journals
Content
ISMIR
Audio Mostly
NIME
JAES
DAFx
ICAD
ICASSP
WASPAA
Please connect to U-M VPN to download the papers!
ISMIR
(2024)
Six Dragons Fly Again: Reviving 15th-Century Korean Court Music with Transformers and Novel Encoding
by Danbinaerin Han, Mark Gotham, Dongmin Kim, Hannah Park, Sihun Lee, and Dasaem Jeong
(2024)
MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models
by Benno Weck, Ilaria Manco, Emmanouil Benetos, Elio Quinton, George Fazekas, and Dmitry Bogdanov
(2024)
ST-ITO: Controlling Audio Effects for Style Transfer with Inference-Time Optimization
by Christian J. Steinmetz, Shubhr Singh, Marco Comunità, Ilias Ibnyahya, Shanxin Yuan, Emmanouil Benetos, and Joshua D. Reiss
(2023)
PESTO: Pitch Estimation with Self-supervised Transposition-equivariant Objective
by Alain Riou, Lattner, Gaëtan Hadjeres, and Geoffroy Peeters
(2023)
CLaMP: Contrastive Language-Music Pre-training for Cross-Modal Symbolic Music Information Retrieval
by Shangda Wu, Dingyao Yu, Xu Tan, and Maosong Sun
(2022)
Performance MIDI-to-score conversion by neural beat tracking
by Lele Liu, Qiuqiang Kong, Veronica Morfi, and Emmanouil Benetos
(2022)
Traces of Globalization in Online Music Consumption Patterns and Results of Recommendation Algorithms
by Oleg Lesota, Emilia Parada-Cabaleiro, Elisabeth Lex, Navid Rekabsaz, Stefan Brandl, and Markus Schedl
(2021)
Leveraging Hierarchical Structures for Few-Shot Musical Instrument Recognition
by Hugo Flores Garcia, Aldo Aguilar, Ethan Manilow, and Bryan Pardo
(2021)
Emotion Embedding Spaces for Matching Music to Stories
by Minz Won, Justin Salamon, Nicholas J. Bryan, Gautham J. Mysore, and Xavier Serra
(2020)
BebopNet: Deep Neural Models for Personalized Jazz Improvisations
by Shunit Haviv Hakimi, Nadav Bhonker, and Ran El-Yaniv
(2020)
Perceptual Vs. Automated Judgements of Music Copyright Infringement
by Yuchen Yuan, Sho Oishi, Charles Cronin, Daniel Müllensiefen, Quentin Atkinson, Shinya Fujii, and Patrick E. Savage
(2020)
Sesquialtera in the Colombian Bambuco: Perception and Estimation of Beat and Meter
by Estefania Cano, Fernando Mora Ángel, Gustavo A. López Gil, José Ricardo Zapata, Antonio Escamilla, Juan F. Alzate, and Moisés Betancur
(2020)
Mode Classification and Natural Units in Plainchant
by Bas Cornelissen, Willem Zuidema, and John Ashley Burgoyne
(2020)
Deconstruct, Analyse, Reconstruct: How to Improve Tempo, Beat, and Downbeat Estimation
by Sebastian Böck and Matthew Davies
(2020)
Essentia.js: a JavaScript Library for Music and Audio Analysis on the Web
by Albin Correya, Dmitry Bogdanov, Luis Joglar-Ongay, and Xavier Serra
Audio Mostly
(2024)
Sonic Shuttle Run: Leveraging Sound Design to Improve Affective Response and Performance in Maximal Exercise Tests
by Daniel Hug, Sascha Ketelhut
(2023)
An Interactive Modular System for Electrophysiological DMIs
by Francesco Di Maggio, Atau Tanaka, David Fierro and Stephen Whitmarsh
(2023)
A Plugin for Neural Audio Synthesis of Impact Sound Effects
by Zih Syuan Yang and Jason Hockman
(2022)
Matching auditory and visual room size, distance, and source orientation in virtual reality
by Matthias Frank and Djordje Perinovic
(2022)
Manipulating Foley Footsteps and Character Realism to Influence Audience Perceptions of a 3D Animated Walk Cycle
by Stuart Cunningham and Iain McGregor
(2021)
Mind the Steps: Towards Auditory Feedback in Tele-Rehabilitation Based on Automated Gait Classification
by Michael Iber, Bernhard Dumphart, Victor-Adriel de Jesus Oliveira, Stefan Ferstl, Joschua M. Reis, Djordje Slijepčević, Mario Heller, Anna-Maria Raberger, and Brian Horsak
(2020)
Standstill to the ‘beat’: Differences in involuntary movement responses to simple and complex rhythms
by Agata Zelechowska, Victor Gonzalez Sanchez, and Alexander Refsum Jensenius
(2020)
Surround Sound Spreads Visual Attentiin and Increases Cognitive Effort in Immersive Media Productions
by Catarina Mendonca and Victoria Korshunova
NIME
(2023)
The BioSynth—an affective biofeedback device grounded in feminist thought
by Erin Gee
(2022)
Bandoneon 2.0: an interdisciplinary project for research and development of electronic bandoneons in Argentina
by Juan Ramos, Esteban Ramón Calcagno, Ramiro oscar Vergara, Pablo Riera, and Joaquín Rizza
(2021)
Spire Muse: A Virtual Musical Partner for Creative Brainstorming
by Notto J. W. Thelle and Philippe Pasquier
(2020)
Nuanced and Interrelated Mediations and Exigencies (NIME): Addressing the Prevailing Political and Epistemological Crises
by Lauren Hayes and Adnan Marquez-Borbon
JAES
(2023)
Speech Intelligibility and Quality Evaluation of Automotive Microphones Using Different Test Metrics and Their Correlation
by Yu Du
(2022)
Web-Based Networked Music Performances via WebRTC: A Low-Latency PCM Audio Solution
by Matteo Sacchetto, Paolo Gastaldi, Chris Chafe, Cristina Rottondi, and Antonio Servetti
(2021)
Do We Really Want to Keep the Gate Threshold That High?
by Grace Brooks, Amandine Pras, Athena Elafros, and Monica Lockett
(2020)
Assessing the Impact of Head-Related Transfer Function Individualization on Task Performance: Case of a Virtual Reality Shooter Game
by David Poirier-Quinot and Brian F.G. Katz
DAFx
(2024)
Wave Digital Modeling of Circuits with Multiple One-Port Nonlinearities Based on Lipschitz-Bounded Neural Networks
by Oliviero Massi, Edoardo Manino, and Alberto Bernardini
(2024)
A Real-Time Approach for Estimating Pulse Tracking Parameters for Beat-Synchronous Audio Effects
by Peter Meier, Simon Schwär, and Meinard Müller
(2024)
CONMOD: Controllable Neural Frame-Based Modulation Effects
by Gyubin Lee, Hounsu Kim, Junwon Lee, and Juhan Nam
(2023)
Differentiable Feedback Delay Network for Colorless Reverberation
by Gloria Dal Santo, Karolina Prawda, Sebastian Jiro Schlecht, and Vesa Välimäki
(2022)
Differentiable Piano Model for Midi-to-Audio Performance Synthesis
by Lenny Renault, Rémi Mignot, and Axel Roebel
(2022)
Differentiable Time–frequency Scattering on GPU
by John Muradeli, Cyrus Vahidi, Changhong Wang, Han Han, Vincent Lostanlen, Mathieu Lagrange, and eorge Fazekas
(2022)
Physical Modeling Using Recurrent Neural Networks with Fast Convolutional Layers
by Julian D. Parker, Sebastian J. Schlecht, Rudolf Rabenstein, and Maximilian Schäfer
Dynamic Grids for Finite-Difference Schemes in Musical Instrument Simulations
by Silvin Willemsen, Stefan Bilbao, Michele Ducceschi, and Stefania Serafin
A Physical Model of the Trombone Using Dynamic Grids for Finite-Difference Schemes
by Silvin Willemsen, Stefan Bilbao, Michele Ducceschi, and Stefania Serafin
(2021)
The Role of Modal Excitation in Colorless Reverberation
by Janis Heldmann and Sebastian J. Schlecht
ICAD
(2024)
Spin-Wave Voices: Sonification of Nanoscale Spin Waves as an Engagement and Possible Research Tool
by Santa Pile, Oleg Lesota, Silvan David Peter, Christina Humer, and Martin Gasser
(2024)
Adapting Audio Mixing Principles and Tools to Parameter Mapping Sonification Design
by Prithvi Ravi Kantan, Sofia Dahl, and Erika G. Spaich
(2022)
AltAR/table: A Platform for Plausible Auditory Augmentation
by Marian Weger, Thomas Hermann, and Robert Höldrich
(2021)
Strategies and tools for the sonification of prosodic data: A composer’s perspective
by Fabio Cifariello Ciardi
(2021)
OpenSpace sonification: complementing visualization of the solar system with sound
by Elias Elmquist, Malin Ejdbo, Alexander Bock, and Niklas Ronnberg
ICASSP
(2024)
Composite Federated Learning with Heterogeneous Data
by Jiaojiao Zhang, Jiang Hu, and Mikael Johansson
(2024)
Touring Samples with Pushforward Maps
by Vivien Cabanners and Charles Arnal
(2024)
STAR: Distilling Speech Temporal Relation for Lightweight Speech Self-Supervised Learning Models
by Kangwook Jang, Sungnyun Kim, and Hoirin Kim
(2024)
DGLP: Incorporating Orientation Information for Enhanced Link Prediction in Directed Graphs
by Yusen Zhang, Yusong Tan, Songlei Jian, Qingbo Wu, and Kenli Li
(2024)
Significant ASR Error Detection for Conversational Voice Assistants
by John Harvill, Rinat Khaziev, Scarlett Li, Randy Cogill, Lidan Wang, Gopinath Chennupati, and Hari Thadakamalla
(2024)
Adapting Frechet Audio Distance for Generative Music Evaluation
by Azalea Gui, Hannes Gamper, Sebastian Braun, and Dimitra Emmanouilidou
(2024)
Robust Symbol-Level Precoding via A Symbol-Perturbed Zero-Forcing Structure
by Wai-You Keung, Yatao Liu, and Wing-Kin Ma
(2023)
Audio signal enhancement with learning from positive and unlabelled data
by Nobutaka Ito and Masashi Sugiyama
(2023)
Contrastive learning-based audio to lyrics alignment for multiple languages
by Simon Durand, Daniel Stoller, and Sebastian Ewert
(2023)
Perspective projection-based 3D CT reconstruction from biplanar X-rays
by Daeun Kyung, Kyungmin Jo, Jaegul Choo, Joonseok Lee, and Edward Choi
(2023)
Large covariance matrix estimation with oracle statistical rate
by Quan Wei and Ziping Zhao
(2023)
Hyperbolic audio source separation
by Darius Petermann, Gordon Wichern, Aswin Shanmugam Subramanian, and Jonathan LeRoux
(2023)
High-dimensional confidence regions in sparse MRI
by Frederik Hoppe, Felix Krahmer, Claudio Mayrink Verdun, Marion Menzel, and Holger Rauhut
(2023)
Solving audio inverse problems with a diffusion model
by Eloi Moliner, Jaakko Lehtinen, and Vesa Valimaki
(2022)
Generalized Sliced Probability Metrics
by Soheil Kolouri, Kimia Nadjahi, Shahin Shahrampour, and Umut Şimşekli
(2022)
VarArray: Array-Geometry-Agnostic Continuous Speech Separation
by Takuya Yoshioka, Xiaofei Wang, Dongmei Wang, Min Tang, Zirun Zhu, and Zhuo Chen
(2022)
Camera Calibration Through Camera Projection Loss
by Talha Hanif Butt and Murtaza Taj
(2022)
Sketched RT3D: How to Reconstruct Billions of Photons Per Second
by Julián Tachella, Michael P. Sheehan, and Mike E. Davies
(2022)
Efficiently and Globally Solving Joint Beamforming and Compression Problem in the Cooperative Cellular Network Via Lagrangian Duality
by Xilai Fan, Ya-Feng Liu, and Liang Liu
WASPAA
(2023)
General Purpose Audio Effect Removal
by Matthew Rice, Christian J. Steinmetz, George Fazekas, and Joshua D. Reiss
(2023)
Differentiable Representation of Warping Based on Lie Group Theory
by Atsushi Miyashita and Tomoki Toda
(2023)
Leveraging Synthetic Data for Improving Chamber Ensemble Separation
by Saurjya Sarkar, Louise Thorpe, Emmanouil Benetos, and Mark Sandler
(2023)
All-In-One Metrical And Functional Structure Analysis with Neighborhood Attentions on Demixed Audio
by Taejun Kim and Juhan Nam
(2021)
Point Cloud Audio Processing
by Krishna Subramani and Paris Smaragdis
(2021)
Filtered Noise Shaping for Time Domain Room Impulse Response Estimation From Reverberant Speech
by Christian J. Steinmetz, Vamsi Krishna Ithapu, and Paul Calamia