DocumentCode
3693886
Title
Effect of language resources on automatic speech recognition for Amharic
Author
Martha Yifiru Tachbelie;Solomon Teferra Abate
Author_Institution
School of Information Science, College of Natural Science, Addis Ababa University (AAU), Addis Ababa, Ethiopia
fYear
2015
Firstpage
1
Lastpage
5
Abstract
This paper presents our investigation of the effect of language resources on the performance of Amharic speech recognition. We have used language model training text of different sizes and seen the effect on word error rate (WER) reduction. Moreover, we have investigated the effect of handling language issues (germination, epenthetic vowel insertion and glottal stop consonant pronunciation) on the performance of speech recognition systems using data-driven phone-level transcriptions. The results of our experiments show that only slight reduction in WER can be obtained by increasing language model training text. However, proper transcription of gemination, the epenthetic vowel and the glottal stop consonant did not bring performance improvement for Amharic speech recognition. This can be attributed to the larger number of phone HMM acoustic models (62 compared to 37 phone set of the grapheme-based phone-level transcriptions) trained with a small (5 hrs) training speech.
Keywords
"Speech","Hidden Markov models","Training","Acoustics","Dictionaries","Automatic speech recognition"
Publisher
ieee
Conference_Titel
AFRICON, 2015
Electronic_ISBN
2153-0033
Type
conf
DOI
10.1109/AFRCON.2015.7331871
Filename
7331871
Link To Document