DocumentCode
2280386
Title
Statistical learning of language pronunciation structure
Author
Korkmazskiy, Filipp
Author_Institution
Multimedia Commun. Res. Lab., Lucent Technol. Bell Labs., Murray Hill, NJ, USA
fYear
2001
fDate
2001
Firstpage
339
Lastpage
342
Abstract
This paper presents a new approach to rule based pronunciation generation. The system presented can automatically learn a new language pronunciation structure and use this knowledge for pronunciation generation for an arbitrary context sensitive language. Unlike conventional text-to-speech systems which are based on the cost expensive human expert knowledge about a specific language, this system can learn by using only a set of spellings and pronunciations. The pronunciations can be obtained either from a pronunciation dictionary or from a phonetically labeled database. The system´s ability to learn the pronunciation structure for any context sensitive language makes it a valuable tool for development of multilingual speech recognition systems. We present experimental results on automatic generation of pronunciations for English, German, Spanish, French and Italian.
Keywords
knowledge based systems; learning (artificial intelligence); linguistics; natural languages; speech recognition; statistical analysis; context sensitive language; language pronunciation structure; multilingual speech recognition; phonetically labeled database; pronunciation dictionary; rule based pronunciation generation; spellings; statistical learning; text-to-speech systems; Costs; Databases; Decision trees; Dictionaries; Humans; Multimedia communication; Natural languages; Speech recognition; Speech synthesis; Statistical learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
Print_ISBN
0-7803-7343-X
Type
conf
DOI
10.1109/ASRU.2001.1034656
Filename
1034656
Link To Document