TY - GEN
T1 - OpenSMILE - The Munich versatile and fast open-source audio feature extractor
AU - Eyben, Florian
AU - Wöllmer, Martin
AU - Schuller, Björn
PY - 2010
Y1 - 2010
N2 - We introduce the openSMILE feature extraction toolkit, which unites feature extraction algorithms from the speech processing and the Music Information Retrieval communities. Audio low-level descriptors such as CHROMA and CENS features, loudness, Mel-frequency cepstral coefficients, perceptual linear predictive cepstral coefficients, linear predictive coefficients, line spectral frequencies, fundamental frequency, and formant frequencies are supported. Delta regression and various statistical functionals can be applied to the low-level descriptors. openSMILE is implemented in C++ with no third-party dependencies for the core functionality. It is fast, runs on Unix and Windows platforms, and has a modular, component based architecture which makes extensions via plug-ins easy. It supports on-line incremental processing for all implemented features as well as off-line and batch processing. Numeric compatibility with future versions is ensured by means of unit tests. openSMILE can be downloaded from http://opensmile.sourceforge.net/.
AB - We introduce the openSMILE feature extraction toolkit, which unites feature extraction algorithms from the speech processing and the Music Information Retrieval communities. Audio low-level descriptors such as CHROMA and CENS features, loudness, Mel-frequency cepstral coefficients, perceptual linear predictive cepstral coefficients, linear predictive coefficients, line spectral frequencies, fundamental frequency, and formant frequencies are supported. Delta regression and various statistical functionals can be applied to the low-level descriptors. openSMILE is implemented in C++ with no third-party dependencies for the core functionality. It is fast, runs on Unix and Windows platforms, and has a modular, component based architecture which makes extensions via plug-ins easy. It supports on-line incremental processing for all implemented features as well as off-line and batch processing. Numeric compatibility with future versions is ensured by means of unit tests. openSMILE can be downloaded from http://opensmile.sourceforge.net/.
KW - audio feature extraction
KW - emotion
KW - music
KW - signal processing
KW - speech
KW - statistical functionals
UR - https://www.scopus.com/pages/publications/78650977476
U2 - 10.1145/1873951.1874246
DO - 10.1145/1873951.1874246
M3 - Conference contribution
AN - SCOPUS:78650977476
SN - 9781605589336
T3 - MM'10 - Proceedings of the ACM Multimedia 2010 International Conference
SP - 1459
EP - 1462
BT - MM'10 - Proceedings of the ACM Multimedia 2010 International Conference
T2 - 18th ACM International Conference on Multimedia ACM Multimedia 2010, MM'10
Y2 - 25 October 2010 through 29 October 2010
ER -