ISMIR 2021
Towards Automatic Instrumentation by Learning to Separate Parts in Symbolic Multitrack Music
Hao-Wen Dong1
Chris Donahue2
Taylor Berg-Kirkpatrick1
Julian McAuley1
1 University of California San Diego
2 Stanford University
paper demo video slides code reviews
All samples are synthesized using FluidSynth with the MuseScore General soundfont.
Mixture (input) | Predicted instrumentation (output) |
---|---|
(guitar) | (piano, guitar, bass, strings, brass) |
Colors: piano, guitar, bass, strings, brass.
Mixture (input) | ![]() |
|
LSTM | ![]() |
|
BiLSTM | ![]() |
|
Ground truth | ![]() |
Colors: first violin, second violin, viola, cello.
Mixture (input) | ![]() |
|
LSTM | ![]() |
|
BiLSTM | ![]() |
|
Ground truth | ![]() |
Original music score:
Colors: soprano, alto, tenor, bass.
Mixture (input) | ![]() |
|
LSTM | ![]() |
|
BiLSTM | ![]() |
|
Ground truth | ![]() |
Original music score:
Colors: pulse wave I, pulse wave II, triangle wave.
Mixture (input) | ![]() |
|
LSTM | ![]() |
|
BiLSTM | ![]() |
|
Ground truth | ![]() |
Colors: piano, guitar, bass, strings, brass.
Mixture (input) | ![]() |
|
LSTM | ![]() |
|
BiLSTM | ![]() |
|
Ground truth | ![]() |
Colors: piano, guitar, bass, strings, brass.
Mixture (input) | ![]() |
|
LSTM | ![]() |
|
BiLSTM | ![]() |
|
Ground truth | ![]() |
Hao-Wen Dong, Chris Donahue, Taylor Berg-Kirkpatrick, and Julian McAuley, “Towards Automatic Instrumentation by Learning to Separate Parts in Symbolic Multitrack Music,” Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), 2021.
@inproceedings{dong2021arranger,
author = {Hao-Wen Dong and Chris Donahue and Taylor Berg-Kirkpatrick and Julian McAuley},
title = {Towards Automatic Instrumentation by Learning to Separate Parts in Symbolic Multitrack Music},
booktitle = {Proceedings of the International Society for Music Information Retrieval Conference (ISMIR)},
year = 2021,
}