DocumentCode :
2768843
Title :
Adapting grapheme-to-phoneme conversion for name recognition
Author :
Li, Xiao ; Gunawardana, Asela ; Acero, Alex
Author_Institution :
Microsoft Res., Redmond
fYear :
2007
fDate :
9-13 Dec. 2007
Firstpage :
130
Lastpage :
135
Abstract :
This work investigates the use of acoustic data to improve grapheme-to-phoneme conversion for name recognition. We introduce a joint model of acoustics and graphonemes, and present two approaches, maximum likelihood training and discriminative training, in adapting graphoneme model parameters. Experiments on a large-scale voice-dialing system show that the maximum likelihood approach yields a relative 7% reduction in SER compared to the best baseline result we obtained without leveraging acoustic data, while discriminative training enlarges the SER reduction to 12%.
Keywords :
audio signal processing; character recognition; maximum likelihood estimation; speech recognition; discriminative training; grapheme-to-phoneme conversion; large-scale voice-dialing system; maximum likelihood training; name recognition; Acoustics; Adaptation model; Large-scale systems; Merging; Natural languages; Random variables; Speech recognition; Target recognition; discriminative training; grapheme-to-phoneme conversion; name recognition; pronunciation model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
Type :
conf
DOI :
10.1109/ASRU.2007.4430097
Filename :
4430097
Link To Document :
بازگشت