• DocumentCode
    294612
  • Title

    Using morphology towards better large-vocabulary speech recognition systems

  • Author

    Geutner, P.

  • Author_Institution
    Dept. of Comput. Sci., Karlsruhe Univ., Germany
  • Volume
    1
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    445
  • Abstract
    To guarantee unrestricted natural language processing, state-of-the-art speech recognition systems require huge dictionaries that increase search space and result in performance degradations. This is especially true for languages where there do exist a large number of inflections and compound words such as German, Spanish, etc. One way to keep up decent recognition results with increasing vocabulary is the use of other base units than simply words. Different decomposition methods originally based on morphological decomposition for the German language are compared. Not only do they counteract the immense vocabulary growth with an increasing amount of training data, also the rate of out-of-vocabulary words, which worsens recognition performance significantly in German, is decreased. A smaller dictionary also leads to 30 K speed improvement during the recognition process. Moreover even if the amount of available training data is quite huge it is often not enough to guarantee robust language model estimations, whereas morphem-based models are capable to do so
  • Keywords
    mathematical morphology; natural languages; speech processing; speech recognition; German language; Spanish; continuous speech recognition; decomposition methods; dictionaries; large-vocabulary speech recognition systems; morphem-based models; morphological decomposition; natural language processing; out-of-vocabulary words; performance degradations; recognition performance; robust language model estimations; search space; training data; Acoustic testing; Databases; Degradation; Dictionaries; Materials testing; Morphology; Natural languages; Robustness; Speech recognition; Testing; Training data; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479624
  • Filename
    479624