• DocumentCode
    3423280
  • Title

    Modified polyphone decision tree specialization for porting multilingual Grapheme based ASR systems to new languages

  • Author

    Stüker, Sebastian

  • Author_Institution
    Inst. fur Theor. Inf., Univ. Karlsruhe, Karlsruhe
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    4249
  • Lastpage
    4252
  • Abstract
    Automatic speech recognition (ASR) systems have been developed only for a very limited number of the estimated 7,000 languages in the world. In order to avoid the evolvement of a digital divide between languages for which ASR systems exist and those without one, it is necessary to be able to rapidly create ASR systems for new languages in a cost efficient way. Grapheme based systems, which eliminate the costly need for a pronunciation dictionary, have been shown to work for a variety of languages. They are thus destined for porting ASR systems to new languages. This paper studies the use of multilingual grapheme based models for rapidly bootstrapping acoustic models in new languages. The cross language performance of a standard, multilingual (ML) acoustic model on a new language is improved by introducing a new, modified version of polyphone decision tree specialization that improves the performance of the ML models by up to 15.5% relative.
  • Keywords
    acoustic signal processing; decision trees; speech recognition; ASR systems; automatic speech recognition; modified polyphone decision tree; multilingual acoustic model; multilingual grapheme models; porting multilingual grapheme; pronunciation dictionary; Acceleration; Automatic speech recognition; Costs; Cultural differences; Decision trees; Dictionaries; Natural languages; Speech recognition; Switches; Vocabulary; Automatic Speech Recognition; Grapheme based acoustic models; Multilingual ASR; Rapid Porting of ASR systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518593
  • Filename
    4518593