DocumentCode
2970665
Title
Pronunciation modeling for dialectal arabic speech recognition
Author
Al-Haj, Hassan ; Hsiao, Roger ; Lane, Ian ; Black, Alan W. ; Waibel, Alex
Author_Institution
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear
2009
fDate
Nov. 13 2009-Dec. 17 2009
Firstpage
525
Lastpage
528
Abstract
Short vowels in Arabic are normally omitted in written text which leads to ambiguity in the pronunciation. This is even more pronounced for dialectal Arabic where a single word can be pronounced quite differently based on the speaker´s nationality, level of education, social class and religion. In this paper we focus on pronunciation modeling for Iraqi-Arabic speech. We introduce multiple pronunciations into the Iraqi speech recognition lexicon, and compare the performance, when weights computed via forced alignment are assigned to the different pronunciations of a word. Incorporating multiple pronunciations improved recognition accuracy compared to a single pronunciation baseline and introducing pronunciation weights further improved performance. Using these techniques an absolute reduction in word-error-rate of 2.4% was obtained compared to the baseline system.
Keywords
linguistics; natural language processing; speech recognition; word processing; Iraqi-Arabic speech; dialectal speech recognition; education level; pronunciation modeling; pronunciation weights; short vowels; speaker nationality; speaker religion; speaker social class; word-error rate; written text; Automatic speech recognition; Books; Computer science; Context; Decoding; Dictionaries; Government; Natural languages; Predictive models; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location
Merano
Print_ISBN
978-1-4244-5478-5
Electronic_ISBN
978-1-4244-5479-2
Type
conf
DOI
10.1109/ASRU.2009.5373245
Filename
5373245
Link To Document