DocumentCode
3410873
Title
Phonetic pronunciations for arabic speech-to-text systems
Author
Diehl, F. ; Gales, M.J.F. ; Tomalin, M. ; Woodland, P.C.
Author_Institution
Eng. Dept., Cambridge Univ., Cambridge
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
1573
Lastpage
1576
Abstract
In this paper two aspects of generating and using phonetic arabic dictionaries are described. First, the use of single pronunciation acoustic models in the context of arabic large vocabulary automatic speech recognition (ASR) is investigated. These have been found to be useful for English ASR systems, when combined with standard multiple pronunciation systems. The second area examined is automatically deriving phonetic "pronunciations" for words that standard approaches, such as the Buckwalter morphological analyzer, cannot handle. Without pronunciations for these words the OOV rates for various Arabic tasks significantly increase. Here, pronunciations are automatically found by first deriving grapheme-to-phone rules, and associated rule probabilities. These are then used to produce the most likely pronunciation, or pronunciations, for any word. These approaches are evaluated on a large vocabulary arabic broadcast news and broadcast conversation transcription task. Both schemes are found to yield gains with a multi-pass/combination framework.
Keywords
speech processing; speech recognition; speech synthesis; Arabic large vocabulary; Arabic speech-to-text systems; automatic speech recognition; multiple pronunciation systems; phonetic Arabic dictionaries; phonetic pronunciations; single pronunciation acoustic models; Acoustical engineering; Automatic speech recognition; Broadcasting; Context modeling; Dictionaries; Frequency; Hidden Markov models; Speech recognition; Training data; Vocabulary; Arabic; Single Pronunciation Modelling; Speech Recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4517924
Filename
4517924
Link To Document