• DocumentCode
    2789650
  • Title

    Morphological and syntactic features for Arabic speech recognition

  • Author

    Kuo, Hong-Kwang Jeff ; Mangu, Lidia ; Emami, Ahmad ; Zitouni, Imed

  • Author_Institution
    IBM T.J.Watson Res. Center, Yorktown Heights, NY, USA
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    5190
  • Lastpage
    5193
  • Abstract
    In this paper, we study the use of morphological and syntactic context features to improve speech recognition of a morphologically rich language like Arabic. We examine a variety of syntactic features, including part-of-speech tags, shallow parse tags, and exposed head words and their non-terminal labels both before and after the word to be predicted. Neural network LMs are used to model these features since they generalize better to unseen events by modeling words and other context features in continuous space. Using morphological and syntactic features, we can improve the word error rate (WER) significantly on various test sets, including EVAL´08U, the unsequestered portion of the DARPA GALE Phase 3 evaluation test set.
  • Keywords
    neural nets; speech recognition; Arabic speech recognition; exposed head words; morphological feature; neural network language model; part-of-speech tags; shallow parse tags; syntactic feature; word error rate; Context modeling; Error analysis; Feature extraction; Lattices; Morphology; Natural languages; Neural networks; Speech recognition; Testing; Vocabulary; Syntax; morphology; neural network language model; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495010
  • Filename
    5495010