DocumentCode :
3670649
Title :
On transcribing informally-pronounced numbers in Romanian speech
Author :
Horia Cucu;Alexandru Caranica;Andi Buzo;Corneliu Burileanu
Author_Institution :
Speech and Dialogue Research Laboratory, University Politehnica of Bucharest, Romania
fYear :
2015
fDate :
7/1/2015 12:00:00 AM
Firstpage :
372
Lastpage :
376
Abstract :
The pronunciation model, a mapping between the lexicon words and their phonetic representation, has a key role in automatic speech recognition. Although many times neglected, the accuracy of this model influences significantly the accuracy of the whole system. This study discusses within-word and cross-word pronunciation variations for Romanian numbers and proposes the solutions to model them in the phonetic dictionary and the language model of an existing speech recognition system for Romanian. The evaluation is performed of a read speech corpus comprising rational numbers with up to three decimal digits. The experiments show a relative WER improvement of 14% over the baseline when within-word pronunciation variations are taken into account and an additional relative WER improvement of 63% when cross-word pronunciation variations are also modelled.
Keywords :
"Speech","Speech recognition","Hidden Markov models","Grammar","Dictionaries","Context","Decoding"
Publisher :
ieee
Conference_Titel :
Telecommunications and Signal Processing (TSP), 2015 38th International Conference on
Type :
conf
DOI :
10.1109/TSP.2015.7296286
Filename :
7296286
Link To Document :
بازگشت