Abstract
This paper describes a novel approach to speaker adaptation. The work was carried out by the author while he was a visiting scientist at the IBM Thomas Watson Research Center in Yorktown Heights/USA. The purpose of the research was to train the IBM speech recognition system with only five minutes of speech and to obtain at least a 95% recognition rate after adaptation for a 5000 word vocabulary recognition task. The adaptation algorithm is based on an Information Theory approach used for estimating the label stream of the new speaker by using a stochastic model describing the spectral differences between the new and a reference speaker. During an evaluation where twelve speakers were tested in ordinary 20 minutes speaker- dependent training mode the average recognition rate for a 5000 word vocabulary task was 96.4%. When the speakers were tested in 5 minutes adaptation mode the recognition rate dropped to 95.2%. A very important point is that the average decoding time increased by a factor of 1.35 while| this factor is often 3-5 if other adaptation algorithms are used.
| Original language | English |
|---|---|
| Pages | 1494-1497 |
| Number of pages | 4 |
| State | Published - 1989 |
| Externally published | Yes |
| Event | 1st European Conference on Speech Communication and Technology, EUROSPEECH 1989 - Paris, France Duration: 27 Sep 1989 → 29 Sep 1989 |
Conference
| Conference | 1st European Conference on Speech Communication and Technology, EUROSPEECH 1989 |
|---|---|
| Country/Territory | France |
| City | Paris |
| Period | 27/09/89 → 29/09/89 |
Fingerprint
Dive into the research topics of 'AN INFORMATION THEORY APPROACH TO SPEAKER ADAPTATION'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver