Best Papers from Selected Conferences/Journals

Content

ISMIR
Audio Mostly
NIME
JAES
DAFx
ICAD
ICASSP
WASPAA

Please connect to U-M VPN to download the papers!

ISMIR

(2024) Six Dragons Fly Again: Reviving 15th-Century Korean Court Music with Transformers and Novel Encoding by Danbinaerin Han, Mark Gotham, Dongmin Kim, Hannah Park, Sihun Lee, and Dasaem Jeong
(2024) MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models by Benno Weck, Ilaria Manco, Emmanouil Benetos, Elio Quinton, George Fazekas, and Dmitry Bogdanov
(2024) ST-ITO: Controlling Audio Effects for Style Transfer with Inference-Time Optimization by Christian J. Steinmetz, Shubhr Singh, Marco Comunità, Ilias Ibnyahya, Shanxin Yuan, Emmanouil Benetos, and Joshua D. Reiss
(2023) PESTO: Pitch Estimation with Self-supervised Transposition-equivariant Objective by Alain Riou, Lattner, Gaëtan Hadjeres, and Geoffroy Peeters
(2023) CLaMP: Contrastive Language-Music Pre-training for Cross-Modal Symbolic Music Information Retrieval by Shangda Wu, Dingyao Yu, Xu Tan, and Maosong Sun
(2022) Performance MIDI-to-score conversion by neural beat tracking by Lele Liu, Qiuqiang Kong, Veronica Morfi, and Emmanouil Benetos
(2022) Traces of Globalization in Online Music Consumption Patterns and Results of Recommendation Algorithms by Oleg Lesota, Emilia Parada-Cabaleiro, Elisabeth Lex, Navid Rekabsaz, Stefan Brandl, and Markus Schedl
(2021) Leveraging Hierarchical Structures for Few-Shot Musical Instrument Recognition by Hugo Flores Garcia, Aldo Aguilar, Ethan Manilow, and Bryan Pardo
(2021) Emotion Embedding Spaces for Matching Music to Stories by Minz Won, Justin Salamon, Nicholas J. Bryan, Gautham J. Mysore, and Xavier Serra
(2020) BebopNet: Deep Neural Models for Personalized Jazz Improvisations by Shunit Haviv Hakimi, Nadav Bhonker, and Ran El-Yaniv
(2020) Perceptual Vs. Automated Judgements of Music Copyright Infringement by Yuchen Yuan, Sho Oishi, Charles Cronin, Daniel Müllensiefen, Quentin Atkinson, Shinya Fujii, and Patrick E. Savage
(2020) Sesquialtera in the Colombian Bambuco: Perception and Estimation of Beat and Meter by Estefania Cano, Fernando Mora Ángel, Gustavo A. López Gil, José Ricardo Zapata, Antonio Escamilla, Juan F. Alzate, and Moisés Betancur
(2020) Mode Classification and Natural Units in Plainchant by Bas Cornelissen, Willem Zuidema, and John Ashley Burgoyne
(2020) Deconstruct, Analyse, Reconstruct: How to Improve Tempo, Beat, and Downbeat Estimation by Sebastian Böck and Matthew Davies
(2020) Essentia.js: a JavaScript Library for Music and Audio Analysis on the Web by Albin Correya, Dmitry Bogdanov, Luis Joglar-Ongay, and Xavier Serra

Audio Mostly

(2024) Sonic Shuttle Run: Leveraging Sound Design to Improve Affective Response and Performance in Maximal Exercise Tests by Daniel Hug, Sascha Ketelhut
(2023) An Interactive Modular System for Electrophysiological DMIs by Francesco Di Maggio, Atau Tanaka, David Fierro and Stephen Whitmarsh
(2023) A Plugin for Neural Audio Synthesis of Impact Sound Effects by Zih Syuan Yang and Jason Hockman
(2022) Matching auditory and visual room size, distance, and source orientation in virtual reality by Matthias Frank and Djordje Perinovic
(2022) Manipulating Foley Footsteps and Character Realism to Influence Audience Perceptions of a 3D Animated Walk Cycle by Stuart Cunningham and Iain McGregor
(2021) Mind the Steps: Towards Auditory Feedback in Tele-Rehabilitation Based on Automated Gait Classification by Michael Iber, Bernhard Dumphart, Victor-Adriel de Jesus Oliveira, Stefan Ferstl, Joschua M. Reis, Djordje Slijepčević, Mario Heller, Anna-Maria Raberger, and Brian Horsak
(2020) Standstill to the ‘beat’: Differences in involuntary movement responses to simple and complex rhythms by Agata Zelechowska, Victor Gonzalez Sanchez, and Alexander Refsum Jensenius
(2020) Surround Sound Spreads Visual Attentiin and Increases Cognitive Effort in Immersive Media Productions by Catarina Mendonca and Victoria Korshunova

NIME

(2023) The BioSynth—an affective biofeedback device grounded in feminist thought by Erin Gee
(2022) Bandoneon 2.0: an interdisciplinary project for research and development of electronic bandoneons in Argentina by Juan Ramos, Esteban Ramón Calcagno, Ramiro oscar Vergara, Pablo Riera, and Joaquín Rizza
(2021) Spire Muse: A Virtual Musical Partner for Creative Brainstorming by Notto J. W. Thelle and Philippe Pasquier
(2020) Nuanced and Interrelated Mediations and Exigencies (NIME): Addressing the Prevailing Political and Epistemological Crises by Lauren Hayes and Adnan Marquez-Borbon

JAES

(2023) Speech Intelligibility and Quality Evaluation of Automotive Microphones Using Different Test Metrics and Their Correlation by Yu Du
(2022) Web-Based Networked Music Performances via WebRTC: A Low-Latency PCM Audio Solution by Matteo Sacchetto, Paolo Gastaldi, Chris Chafe, Cristina Rottondi, and Antonio Servetti
(2021) Do We Really Want to Keep the Gate Threshold That High? by Grace Brooks, Amandine Pras, Athena Elafros, and Monica Lockett
(2020) Assessing the Impact of Head-Related Transfer Function Individualization on Task Performance: Case of a Virtual Reality Shooter Game by David Poirier-Quinot and Brian F.G. Katz

DAFx

(2024) Wave Digital Modeling of Circuits with Multiple One-Port Nonlinearities Based on Lipschitz-Bounded Neural Networks by Oliviero Massi, Edoardo Manino, and Alberto Bernardini
(2024) A Real-Time Approach for Estimating Pulse Tracking Parameters for Beat-Synchronous Audio Effects by Peter Meier, Simon Schwär, and Meinard Müller
(2024) CONMOD: Controllable Neural Frame-Based Modulation Effects by Gyubin Lee, Hounsu Kim, Junwon Lee, and Juhan Nam
(2023) Differentiable Feedback Delay Network for Colorless Reverberation by Gloria Dal Santo, Karolina Prawda, Sebastian Jiro Schlecht, and Vesa Välimäki
(2022) Differentiable Piano Model for Midi-to-Audio Performance Synthesis by Lenny Renault, Rémi Mignot, and Axel Roebel
(2022) Differentiable Time–frequency Scattering on GPU by John Muradeli, Cyrus Vahidi, Changhong Wang, Han Han, Vincent Lostanlen, Mathieu Lagrange, and eorge Fazekas
(2022) Physical Modeling Using Recurrent Neural Networks with Fast Convolutional Layers by Julian D. Parker, Sebastian J. Schlecht, Rudolf Rabenstein, and Maximilian Schäfer
(2021) Dynamic Grids for Finite-Difference Schemes in Musical Instrument Simulations by Silvin Willemsen, Stefan Bilbao, Michele Ducceschi, and Stefania Serafin
(2021) A Physical Model of the Trombone Using Dynamic Grids for Finite-Difference Schemes by Silvin Willemsen, Stefan Bilbao, Michele Ducceschi, and Stefania Serafin
(2021) The Role of Modal Excitation in Colorless Reverberation by Janis Heldmann and Sebastian J. Schlecht

ICAD

(2024) Spin-Wave Voices: Sonification of Nanoscale Spin Waves as an Engagement and Possible Research Tool by Santa Pile, Oleg Lesota, Silvan David Peter, Christina Humer, and Martin Gasser
(2024) Adapting Audio Mixing Principles and Tools to Parameter Mapping Sonification Design by Prithvi Ravi Kantan, Sofia Dahl, and Erika G. Spaich
(2022) AltAR/table: A Platform for Plausible Auditory Augmentation by Marian Weger, Thomas Hermann, and Robert Höldrich
(2021) Strategies and tools for the sonification of prosodic data: A composer’s perspective by Fabio Cifariello Ciardi
(2021) OpenSpace sonification: complementing visualization of the solar system with sound by Elias Elmquist, Malin Ejdbo, Alexander Bock, and Niklas Ronnberg

ICASSP

(2024) Composite Federated Learning with Heterogeneous Data by Jiaojiao Zhang, Jiang Hu, and Mikael Johansson
(2024) Touring Samples with Pushforward Maps by Vivien Cabanners and Charles Arnal
(2024) STAR: Distilling Speech Temporal Relation for Lightweight Speech Self-Supervised Learning Models by Kangwook Jang, Sungnyun Kim, and Hoirin Kim
(2024) DGLP: Incorporating Orientation Information for Enhanced Link Prediction in Directed Graphs by Yusen Zhang, Yusong Tan, Songlei Jian, Qingbo Wu, and Kenli Li
(2024) Significant ASR Error Detection for Conversational Voice Assistants by John Harvill, Rinat Khaziev, Scarlett Li, Randy Cogill, Lidan Wang, Gopinath Chennupati, and Hari Thadakamalla
(2024) Adapting Frechet Audio Distance for Generative Music Evaluation by Azalea Gui, Hannes Gamper, Sebastian Braun, and Dimitra Emmanouilidou
(2024) Robust Symbol-Level Precoding via A Symbol-Perturbed Zero-Forcing Structure by Wai-You Keung, Yatao Liu, and Wing-Kin Ma
(2023) Audio signal enhancement with learning from positive and unlabelled data by Nobutaka Ito and Masashi Sugiyama
(2023) Contrastive learning-based audio to lyrics alignment for multiple languages by Simon Durand, Daniel Stoller, and Sebastian Ewert
(2023) Perspective projection-based 3D CT reconstruction from biplanar X-rays by Daeun Kyung, Kyungmin Jo, Jaegul Choo, Joonseok Lee, and Edward Choi
(2023) Large covariance matrix estimation with oracle statistical rate by Quan Wei and Ziping Zhao
(2023) Hyperbolic audio source separation by Darius Petermann, Gordon Wichern, Aswin Shanmugam Subramanian, and Jonathan LeRoux
(2023) High-dimensional confidence regions in sparse MRI by Frederik Hoppe, Felix Krahmer, Claudio Mayrink Verdun, Marion Menzel, and Holger Rauhut
(2023) Solving audio inverse problems with a diffusion model by Eloi Moliner, Jaakko Lehtinen, and Vesa Valimaki
(2022) Generalized Sliced Probability Metrics by Soheil Kolouri, Kimia Nadjahi, Shahin Shahrampour, and Umut Şimşekli
(2022) VarArray: Array-Geometry-Agnostic Continuous Speech Separation by Takuya Yoshioka, Xiaofei Wang, Dongmei Wang, Min Tang, Zirun Zhu, and Zhuo Chen
(2022) Camera Calibration Through Camera Projection Loss by Talha Hanif Butt and Murtaza Taj
(2022) Sketched RT3D: How to Reconstruct Billions of Photons Per Second by Julián Tachella, Michael P. Sheehan, and Mike E. Davies
(2022) Efficiently and Globally Solving Joint Beamforming and Compression Problem in the Cooperative Cellular Network Via Lagrangian Duality by Xilai Fan, Ya-Feng Liu, and Liang Liu

WASPAA

(2023) General Purpose Audio Effect Removal by Matthew Rice, Christian J. Steinmetz, George Fazekas, and Joshua D. Reiss
(2023) Differentiable Representation of Warping Based on Lie Group Theory by Atsushi Miyashita and Tomoki Toda
(2023) Leveraging Synthetic Data for Improving Chamber Ensemble Separation by Saurjya Sarkar, Louise Thorpe, Emmanouil Benetos, and Mark Sandler
(2023) All-In-One Metrical And Functional Structure Analysis with Neighborhood Attentions on Demixed Audio by Taejun Kim and Juhan Nam
(2021) Point Cloud Audio Processing by Krishna Subramani and Paris Smaragdis
(2021) Filtered Noise Shaping for Time Domain Room Impulse Response Estimation From Reverberant Speech by Christian J. Steinmetz, Vamsi Krishna Ithapu, and Paul Calamia

Hosted on GitHub Pages. Powered by Jekyll. Theme adapted from minimal by orderedlist.