TY - JOUR
T1 - Music Theoretic and Perception-based Features for Audio Key Determination
AU - Schuller, Björn
AU - Gollan, Benedikt
PY - 2012/6
Y1 - 2012/6
N2 - The musical key of a piece is the fundamental knowledge for many Music Information Retrieval tasks as automatic transcription, chord detection or automatic play list generation. To this end, novel features are proposed and evaluated on the basis of musical knowledge, in this article-a total of 13 feature groups based on chromatic representation include scales, chords, major and minor Probe Tone Ratings, and further derived variations. We present extensive results on a database containing 35 h of originally recorded music covering diverse genres as popular, classical, and Jazz music. In the pre-processing, adjustment of reference pitch classes, dB(A)-weighting, band-pass-filter type selection, frequency selection, and gating time are discussed in detail. In addition to traditional correlation-based approaches we employ a data-driven model using Support Vector Machines. Overall, the proposed features enhance correlation results and their combination boosts data-driven audio key determination accuracy. The latter approach based on learnt models significantly outperforms correlation-based determination by 5.0% and 6.7% absolute for the determination of 12 and 24 major and minor keys (clustered or separately), respectively. Maximum accuracy reaches 77.3% and 62.1% for these 12 or 24 keys over the diverse recordings.
AB - The musical key of a piece is the fundamental knowledge for many Music Information Retrieval tasks as automatic transcription, chord detection or automatic play list generation. To this end, novel features are proposed and evaluated on the basis of musical knowledge, in this article-a total of 13 feature groups based on chromatic representation include scales, chords, major and minor Probe Tone Ratings, and further derived variations. We present extensive results on a database containing 35 h of originally recorded music covering diverse genres as popular, classical, and Jazz music. In the pre-processing, adjustment of reference pitch classes, dB(A)-weighting, band-pass-filter type selection, frequency selection, and gating time are discussed in detail. In addition to traditional correlation-based approaches we employ a data-driven model using Support Vector Machines. Overall, the proposed features enhance correlation results and their combination boosts data-driven audio key determination accuracy. The latter approach based on learnt models significantly outperforms correlation-based determination by 5.0% and 6.7% absolute for the determination of 12 and 24 major and minor keys (clustered or separately), respectively. Maximum accuracy reaches 77.3% and 62.1% for these 12 or 24 keys over the diverse recordings.
UR - http://www.scopus.com/inward/record.url?scp=84863519780&partnerID=8YFLogxK
U2 - 10.1080/09298215.2011.618543
DO - 10.1080/09298215.2011.618543
M3 - Article
AN - SCOPUS:84863519780
SN - 0929-8215
VL - 41
SP - 175
EP - 193
JO - Journal of New Music Research
JF - Journal of New Music Research
IS - 2
ER -