Hybrid NN/HMM acoustic modeling techniques for distributed speech recognition

Jan Stadermann, Gerhard Rigoll

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

Distributed speech recognition (DSR) where the recognizer is split up into two parts and connected via a transmission channel offers new perspectives for improving the speech recognition performance in mobile environments. In this work, we present the integration of hybrid acoustic models using tied-posteriors in a distributed environment. A comparison with standard Gaussian models is performed on the AURORA2 task and the WSJ0 task. Word-based HMMs and phoneme-based HMMs are trained for distributed and non-distributed recognition using either MFCC or RASTA-PLP features. The results show that hybrid modeling techniques can outperform standard continuous systems on this task. Especially the tied-posteriors approach is shown to be usable for DSR in a very flexible way since the client can be modified without a change at the server site and vice versa.

Original languageEnglish
Pages (from-to)1037-1046
Number of pages10
JournalSpeech Communication
Volume48
Issue number8
DOIs
StatePublished - Aug 2006

Keywords

  • Distributed speech recognition
  • Hybrid speech recognition
  • Tied-posteriors

Fingerprint

Dive into the research topics of 'Hybrid NN/HMM acoustic modeling techniques for distributed speech recognition'. Together they form a unique fingerprint.

Cite this