Pseudo 2-dimensional hidden Markov models in speech recognition

S. Werner, G. Rigoll

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

In this paper, the usage of pseudo 2-dimensional hidden Markov models for speech recognition is discussed. This image processing method should better model the time-frequency structure in speech signals. The method calculates the emission probability of a standard HMM by embedded HMM for each state. If a temporal sequence of spectral vectors is imagined as a spectrogram, this leads to a 2-dimensional warping of the spectrogram. This additional warping of the frequency axis could be useful for speaker-independent recognition and can be considered to be similar to a vocal tract normalization. The effects of this paradigm are investigated in this paper using the TI-Digits database.

Original languageEnglish
Title of host publication2001 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2001 - Conference Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages441-444
Number of pages4
ISBN (Electronic)078037343X, 9780780373433
DOIs
StatePublished - 2001
Externally publishedYes
EventIEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2001 - Madonna di Campiglio, Italy
Duration: 9 Dec 200113 Dec 2001

Publication series

Name2001 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2001 - Conference Proceedings

Conference

ConferenceIEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2001
Country/TerritoryItaly
CityMadonna di Campiglio
Period9/12/0113/12/01

Fingerprint

Dive into the research topics of 'Pseudo 2-dimensional hidden Markov models in speech recognition'. Together they form a unique fingerprint.

Cite this