Hybrid NN/HMM acoustic modeling techniques for distributed speech recognition

Jan Stadermann, Gerhard Rigoll

Publikation: Beitrag in FachzeitschriftArtikelBegutachtung

3 Zitate (Scopus)

Abstract

Distributed speech recognition (DSR) where the recognizer is split up into two parts and connected via a transmission channel offers new perspectives for improving the speech recognition performance in mobile environments. In this work, we present the integration of hybrid acoustic models using tied-posteriors in a distributed environment. A comparison with standard Gaussian models is performed on the AURORA2 task and the WSJ0 task. Word-based HMMs and phoneme-based HMMs are trained for distributed and non-distributed recognition using either MFCC or RASTA-PLP features. The results show that hybrid modeling techniques can outperform standard continuous systems on this task. Especially the tied-posteriors approach is shown to be usable for DSR in a very flexible way since the client can be modified without a change at the server site and vice versa.

OriginalspracheEnglisch
Seiten (von - bis)1037-1046
Seitenumfang10
FachzeitschriftSpeech Communication
Jahrgang48
Ausgabenummer8
DOIs
PublikationsstatusVeröffentlicht - Aug. 2006

Fingerprint

Untersuchen Sie die Forschungsthemen von „Hybrid NN/HMM acoustic modeling techniques for distributed speech recognition“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren