DocumentCode :
3693886
Title :
Effect of language resources on automatic speech recognition for Amharic
Author :
Martha Yifiru Tachbelie;Solomon Teferra Abate
Author_Institution :
School of Information Science, College of Natural Science, Addis Ababa University (AAU), Addis Ababa, Ethiopia
fYear :
2015
Firstpage :
1
Lastpage :
5
Abstract :
This paper presents our investigation of the effect of language resources on the performance of Amharic speech recognition. We have used language model training text of different sizes and seen the effect on word error rate (WER) reduction. Moreover, we have investigated the effect of handling language issues (germination, epenthetic vowel insertion and glottal stop consonant pronunciation) on the performance of speech recognition systems using data-driven phone-level transcriptions. The results of our experiments show that only slight reduction in WER can be obtained by increasing language model training text. However, proper transcription of gemination, the epenthetic vowel and the glottal stop consonant did not bring performance improvement for Amharic speech recognition. This can be attributed to the larger number of phone HMM acoustic models (62 compared to 37 phone set of the grapheme-based phone-level transcriptions) trained with a small (5 hrs) training speech.
Keywords :
"Speech","Hidden Markov models","Training","Acoustics","Dictionaries","Automatic speech recognition"
Publisher :
ieee
Conference_Titel :
AFRICON, 2015
Electronic_ISBN :
2153-0033
Type :
conf
DOI :
10.1109/AFRCON.2015.7331871
Filename :
7331871
Link To Document :
بازگشت