Title :
Phonetic typewriter based on phoneme source modeling
Author :
Yamada, Tomokazu ; Hanazawa, Toshiyuki ; Kawabata, Takeshi ; Matsunaga, Shinichiro ; Shikano, Kiyohiro
Author_Institution :
NTT Human Interface Lab., Tokyo, Japan
Abstract :
A phonetic typewriter that utilizes the underlying statistical structure of phoneme/character sequences is described. The syllable/character trigram approach to language modeling is adopted to make language source models. These are obtained by calculating trigram probabilities, using a large text database. The phonetic typewriter is tested using 279 phrases uttered by one male speaker, and the syllable source model achieves a 94.9% phoneme recognition rate with the test-set phoneme perplexity of 3.9. Without the syllable trigram, the phoneme recognition rate is only 73.2%. A trigram model based on characters is also evaluated. This model can reduce the phoneme perplexity significantly compared with that of the syllable trigram
Keywords :
speech recognition; typewriters; voice equipment; automatic speech recognition; character sequences; character trigram; language modeling; language source models; phoneme recognition rate; phoneme source modeling; phonetic typewriter; syllable trigram; test-set phoneme perplexity; text database; trigram model; trigram probabilities; Cepstral analysis; Databases; Hidden Markov models; Humans; Laboratories; Linear predictive coding; Predictive models; Probability; Speech analysis; Testing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location :
Toronto, Ont.
Print_ISBN :
0-7803-0003-3
DOI :
10.1109/ICASSP.1991.150304