Skip to main navigation Skip to search Skip to main content

OpenSMILE - The Munich versatile and fast open-source audio feature extractor

  • Technical University of Munich

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2902 Scopus citations

Abstract

We introduce the openSMILE feature extraction toolkit, which unites feature extraction algorithms from the speech processing and the Music Information Retrieval communities. Audio low-level descriptors such as CHROMA and CENS features, loudness, Mel-frequency cepstral coefficients, perceptual linear predictive cepstral coefficients, linear predictive coefficients, line spectral frequencies, fundamental frequency, and formant frequencies are supported. Delta regression and various statistical functionals can be applied to the low-level descriptors. openSMILE is implemented in C++ with no third-party dependencies for the core functionality. It is fast, runs on Unix and Windows platforms, and has a modular, component based architecture which makes extensions via plug-ins easy. It supports on-line incremental processing for all implemented features as well as off-line and batch processing. Numeric compatibility with future versions is ensured by means of unit tests. openSMILE can be downloaded from http://opensmile.sourceforge.net/.

Original languageEnglish
Title of host publicationMM'10 - Proceedings of the ACM Multimedia 2010 International Conference
Pages1459-1462
Number of pages4
DOIs
StatePublished - 2010
Event18th ACM International Conference on Multimedia ACM Multimedia 2010, MM'10 - Firenze, Italy
Duration: 25 Oct 201029 Oct 2010

Publication series

NameMM'10 - Proceedings of the ACM Multimedia 2010 International Conference

Conference

Conference18th ACM International Conference on Multimedia ACM Multimedia 2010, MM'10
Country/TerritoryItaly
CityFirenze
Period25/10/1029/10/10

Keywords

  • audio feature extraction
  • emotion
  • music
  • signal processing
  • speech
  • statistical functionals

Fingerprint

Dive into the research topics of 'OpenSMILE - The Munich versatile and fast open-source audio feature extractor'. Together they form a unique fingerprint.

Cite this