Applying Bayesian belief networks in approximate string matching for robust keyword-based retrieval

Björn Schuller, Ronald Müller, Gerhard Rigoll, Manfred Lang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

In this work we present a novel approach towards robust keyword-based retrieval. Thereby Bayesian Belief Networks are applied in a word-model based Approximate String Matching algorithm. Apart from proved reliable performance of a working implementation on standard sources like digital text, wholly probabilistic modeling allows for integration of confidence measures and hypotheses obtained from preprocessing stages like handwriting recognition or optical character recognition respecting uncertainties on the lower levels. Furthermore a flexible method to include the modeling of specific error types deriving from humans and various input sources is provided. The remarkable performance of the algorithms presented was tested during extensive evaluation with respect to Levenstein-Distance, which can be seen as basis of state-of-the-art methods in this research field. The tests ran on a 14K database containing common international music titles and four 10K databases consisting of the most frequently used words in English, German, French, and Dutch language.

Original languageEnglish
Title of host publication2004 IEEE International Conference on Multimedia and Expo (ICME)
Pages1999-2002
Number of pages4
StatePublished - 2004
Event2004 IEEE International Conference on Multimedia and Expo (ICME) - Taipei, Taiwan, Province of China
Duration: 27 Jun 200430 Jun 2004

Publication series

Name2004 IEEE International Conference on Multimedia and Expo (ICME)
Volume3

Conference

Conference2004 IEEE International Conference on Multimedia and Expo (ICME)
Country/TerritoryTaiwan, Province of China
CityTaipei
Period27/06/0430/06/04

Fingerprint

Dive into the research topics of 'Applying Bayesian belief networks in approximate string matching for robust keyword-based retrieval'. Together they form a unique fingerprint.

Cite this