TY - GEN
T1 - A discriminative approach to polyphonic piano note transcription using supervised non-negative matrix factorization
AU - Weninger, Felix
AU - Kirst, Christian
AU - Schuller, Bjorn
AU - Bungartz, Hans Joachim
PY - 2013/10/18
Y1 - 2013/10/18
N2 - We introduce a novel method for the transcription of polyphonic piano music by discriminative training of support vector machines (SVMs). As features, we use pitch activations computed by supervised non-negative matrix factorization from low-level spectral features. Different approaches to low-level feature extraction, NMF dictionary learning and activation feature extraction are analyzed in a large-scale evaluation on eight hours of piano music including synthesized and real recordings. We conclude that the proposed method delivers state-of-the-art results and clearly outperforms SVMs using simple spectral features.
AB - We introduce a novel method for the transcription of polyphonic piano music by discriminative training of support vector machines (SVMs). As features, we use pitch activations computed by supervised non-negative matrix factorization from low-level spectral features. Different approaches to low-level feature extraction, NMF dictionary learning and activation feature extraction are analyzed in a large-scale evaluation on eight hours of piano music including synthesized and real recordings. We conclude that the proposed method delivers state-of-the-art results and clearly outperforms SVMs using simple spectral features.
KW - Transcription
KW - music information retrieval
KW - non-negative matrix factorization
KW - sparse coding
UR - http://www.scopus.com/inward/record.url?scp=84890485042&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2013.6637598
DO - 10.1109/ICASSP.2013.6637598
M3 - Conference contribution
AN - SCOPUS:84890485042
SN - 9781479903566
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 6
EP - 10
BT - 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Proceedings
T2 - 2013 38th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013
Y2 - 26 May 2013 through 31 May 2013
ER -