• DocumentCode
    3693886
  • Title

    Effect of language resources on automatic speech recognition for Amharic

  • Author

    Martha Yifiru Tachbelie;Solomon Teferra Abate

  • Author_Institution
    School of Information Science, College of Natural Science, Addis Ababa University (AAU), Addis Ababa, Ethiopia
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    This paper presents our investigation of the effect of language resources on the performance of Amharic speech recognition. We have used language model training text of different sizes and seen the effect on word error rate (WER) reduction. Moreover, we have investigated the effect of handling language issues (germination, epenthetic vowel insertion and glottal stop consonant pronunciation) on the performance of speech recognition systems using data-driven phone-level transcriptions. The results of our experiments show that only slight reduction in WER can be obtained by increasing language model training text. However, proper transcription of gemination, the epenthetic vowel and the glottal stop consonant did not bring performance improvement for Amharic speech recognition. This can be attributed to the larger number of phone HMM acoustic models (62 compared to 37 phone set of the grapheme-based phone-level transcriptions) trained with a small (5 hrs) training speech.
  • Keywords
    "Speech","Hidden Markov models","Training","Acoustics","Dictionaries","Automatic speech recognition"
  • Publisher
    ieee
  • Conference_Titel
    AFRICON, 2015
  • Electronic_ISBN
    2153-0033
  • Type

    conf

  • DOI
    10.1109/AFRCON.2015.7331871
  • Filename
    7331871