Title :
Phoneme-to-grapheme conversion for out-of-vocabulary words in large vocabulary speech recognition
Author :
Decadt, Bart ; Duchateau, Jacques ; Daelemans, Walter ; Wambacq, Patrick
Author_Institution :
CNTS Language Technol. Group, Antwerp Univ., Belgium
Abstract :
We describe a method to enhance the readability of the textual output in a large vocabulary continuous speech recognition system when out-of-vocabulary words occur. The basic idea is to replace uncertain words in the transcriptions with a phoneme recognition result that is post-processed using a phoneme-to-grapheme converter. This converter turns phoneme strings into grapheme strings and is trained using machine learning techniques. Experiments show that, even when the grapheme strings are not fully correct, the resulting transcriptions are more easily readable than the original ones.
Keywords :
acoustic signal processing; learning (artificial intelligence); speech recognition; text analysis; vocabulary; acoustic data; continuous speech recognition; grapheme string; machine learning techniques; out-of-vocabulary words; phoneme recognizer; phoneme-to-grapheme conversion; textual output; Automatic speech recognition; Broadcasting; Electronic mail; Machine learning; Natural languages; Paper technology; Speech processing; Speech recognition; Text recognition; Vocabulary;
Conference_Titel :
Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
Print_ISBN :
0-7803-7343-X
DOI :
10.1109/ASRU.2001.1034672