Tied posteriors: An approach for effective introduction of context dependency in hybrid NN/HMM LVCSR

Jörg Rottland, Gerhard Rigoll

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

19 Scopus citations

Abstract

This paper presents a method to improve the recognition rate of hybrid connectionist/HMM speech recognition systems. At the same time this approach allows the easy introduction of context dependent models in the hybrid framework. The approach is based on a standard hybrid connectionist/HMM recognizer, in which the neural nets are trained to estimate the a posteriori probabilities for all phones in each input frame. In the approach presented here, the probabilities of the neural nets are used to replace the codebook of a tied-mixture HMM system. Therefore the resulting system is called tied posterior. The advantages of this structure are that an arbitrary HMM-topology can be used, and that all context dependency and all clustering techniques used in tied-mixture systems can be applied to this hybrid speech recognition system. The approach has been evaluated on the Wall Street Journal (WSJ) database, with the result, that it outperforms the standard hybrid approach on this task.

Original languageEnglish
Title of host publicationSpeech Processing II
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1241-1244
Number of pages4
ISBN (Electronic)0780362934
DOIs
StatePublished - 2000
Externally publishedYes
Event25th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2000 - Istanbul, Turkey
Duration: 5 Jun 20009 Jun 2000

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume3
ISSN (Print)1520-6149

Conference

Conference25th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2000
Country/TerritoryTurkey
CityIstanbul
Period5/06/009/06/00

Fingerprint

Dive into the research topics of 'Tied posteriors: An approach for effective introduction of context dependency in hybrid NN/HMM LVCSR'. Together they form a unique fingerprint.

Cite this