• DocumentCode
    3428181
  • Title

    A new approach to develop a syllable based, continuous Amharic speech recognizer

  • Author

    Gebremedhin, Yitagessu Birhanu ; Duckhorn, Frank ; Hoffmann, Raik ; Kraljevski, Ivan

  • Author_Institution
    Dept. of Syst. Theor. & Speech Technol., Dresden Univ. of Technol., Dresden, Germany
  • fYear
    2013
  • fDate
    1-4 July 2013
  • Firstpage
    1684
  • Lastpage
    1689
  • Abstract
    All of the previous syllable based Automatic Speech Recognizers (ASRs) for the Amharic language are built by training a separate acoustic model for each of the 196 distinctly pronounced Consonant-Vowel (CV) syllable. In this paper, we will demonstrate that a smaller number of acoustic models are sufficient to build a syllable based, speaker independent, continuous, Amharic ASR. It is built for weather forecast and business report applications using the UASR (Unified Approach to Speech Synthesis and Recognition) Tool kit. A new speech corpus, which is of more than 35 hours duration, is used for training. It is a collection of corpora recorded in three different environments in order to make the recognizer less sensitive to recording environment and microphone changes. The grammar is finite state transducer based and the lexical model consists of thousands of words. Though acoustic models for only 93 syllables are trained, a recognition accuracy of 93.26% is achieved on a test set that has 4,000 words collected from 10 speakers.
  • Keywords
    natural language processing; speech recognition; speech synthesis; Amharic ASR; Amharic language; CV syllable; UASR; acoustic models; automatic speech recognizers; business report applications; consonant vowel syllable; continuous Amharic speech recognizer; finite state transducer; grammar; microphone changes; recording environment; speaker independent; speech corpus; syllable based; unified approach to speech synthesis and recognition; weather forecast; Accuracy; Acoustics; Grammar; Hidden Markov models; Speech; Speech recognition; Training; ASR; Amharic; CV-syllable; Finite State Transducers; UASR;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    EUROCON, 2013 IEEE
  • Conference_Location
    Zagreb
  • Print_ISBN
    978-1-4673-2230-0
  • Type

    conf

  • DOI
    10.1109/EUROCON.2013.6625203
  • Filename
    6625203