A hybrid music retrieval system using belief networks to integrate multimodal queries and contextual knowledge

Björn Schuller, Martin Zobl, Gerhard Rigoll, Manfred Lang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

Recently an increasing interest in music retrieval can be observed. Due to the growing amount of online and offline available music and a broadening user spectrum more efficient query methods are needed. We believe that only a parallel multimodal combination of different input modalities forms the most intuitive way to access desired media for any user. In this paper we introduce a query by humming, speaking, writing, and typing. The strengths of each modality are combined in a synergetic manner by a soft decision fusion. Songs can be referenced by their according melody, artist, title or other specific information. Further more the recognition of the actual user's emotion and external contextual knowledge helps to build an expectance of the intended song at a time. This constrains the hypothesis sphere of possible songs and leads to a more robust recognition or even a suggestive query. A combination of artificial neural networks, hidden Markov models and dynamic time warping integrated in a Bayesian belief network framework build the mathematical background of the chosen hybrid architecture. We address the implementation of a working system and results achieved by the introduced methods.

Original languageEnglish
Title of host publicationProceedings - 2003 International Conference on Multimedia and Expo, ICME
PublisherIEEE Computer Society
Pages57-60
Number of pages4
ISBN (Electronic)0780379659
DOIs
StatePublished - 2003
Event2003 International Conference on Multimedia and Expo, ICME 2003 - Baltimore, United States
Duration: 6 Jul 20039 Jul 2003

Publication series

NameProceedings - IEEE International Conference on Multimedia and Expo
Volume1
ISSN (Print)1945-7871
ISSN (Electronic)1945-788X

Conference

Conference2003 International Conference on Multimedia and Expo, ICME 2003
Country/TerritoryUnited States
CityBaltimore
Period6/07/039/07/03

Fingerprint

Dive into the research topics of 'A hybrid music retrieval system using belief networks to integrate multimodal queries and contextual knowledge'. Together they form a unique fingerprint.

Cite this