Effect of language resources on automatic speech recognition for Amharic

Author

Martha Yifiru Tachbelie;Solomon Teferra Abate

Author_Institution

School of Information Science, College of Natural Science, Addis Ababa University (AAU), Addis Ababa, Ethiopia

fYear

2015

Firstpage

1

Lastpage

5

Abstract

This paper presents our investigation of the effect of language resources on the performance of Amharic speech recognition. We have used language model training text of different sizes and seen the effect on word error rate (WER) reduction. Moreover, we have investigated the effect of handling language issues (germination, epenthetic vowel insertion and glottal stop consonant pronunciation) on the performance of speech recognition systems using data-driven phone-level transcriptions. The results of our experiments show that only slight reduction in WER can be obtained by increasing language model training text. However, proper transcription of gemination, the epenthetic vowel and the glottal stop consonant did not bring performance improvement for Amharic speech recognition. This can be attributed to the larger number of phone HMM acoustic models (62 compared to 37 phone set of the grapheme-based phone-level transcriptions) trained with a small (5 hrs) training speech.

Keywords

"Speech","Hidden Markov models","Training","Acoustics","Dictionaries","Automatic speech recognition"

Publisher

ieee

Conference_Titel

AFRICON, 2015

Electronic_ISBN

2153-0033

Type

conf

DOI

10.1109/AFRCON.2015.7331871

Filename

7331871