Skip to main navigation Skip to search Skip to main content

openXBOW – Introducing the passau open-source crossmodal bag-of-words toolkit

Research output: Contribution to journalArticlepeer-review

131 Scopus citations

Abstract

We introduce openXBOW, an open-source toolkit for the generation of bag-of-words (BoW) representations from multimodal input. In the BoW principle, word histograms were first used as features in document classification, but the idea was and can easily be adapted to, e. g., acoustic or visual descriptors, introducing a prior step of vector quantisation. The openXBOW toolkit supports arbitrary numeric input features and text input and concatenates computed sub-bags to a final bag. It provides a variety of extensions and options. To our knowledge, openXBOW is the first publicly available toolkit for the generation of crossmodal bags-of-words. The capabilities of the tool have been exemplified in different scenarios: sentiment analysis in tweets, classification of snore sounds, and time-dependent emotion recognition based on acoustic, linguistic, and visual information, where improved results over other feature representations were observed.

Original languageEnglish
Pages (from-to)1-5
Number of pages5
JournalJournal of Machine Learning Research
Volume18
StatePublished - 1 Oct 2017
Externally publishedYes

Keywords

  • Bag-of-words
  • Feature learning
  • Histogram feature representations
  • Multimodal signal processing

Fingerprint

Dive into the research topics of 'openXBOW – Introducing the passau open-source crossmodal bag-of-words toolkit'. Together they form a unique fingerprint.

Cite this