• DocumentCode
    2660030
  • Title

    A syntactic language model based on incremental CCG parsing

  • Author

    Hassan, Hany ; Sima´an, K. ; Way, Andy

  • Author_Institution
    Sch. of Comput., Dublin City Univ., Dublin
  • fYear
    2008
  • fDate
    15-19 Dec. 2008
  • Firstpage
    205
  • Lastpage
    208
  • Abstract
    Syntactically-enriched language models (parsers) constitute a promising component in applications such as machine translation and speech-recognition. To maintain a useful level of accuracy, existing parsers are non-incremental and must span a combinatorially growing space of possible structures as every input word is processed. This prohibits their incorporation into standard linear-time decoders. In this paper, we present an incremental, linear-time dependency parser based on Combinatory Categorial Grammar (CCG) and classification techniques. We devise a deterministic transform of CCG-bank canonical derivations into incremental ones, and train our parser on this data. We discover that a cascaded, incremental version provides an appealing balance between efficiency and accuracy.
  • Keywords
    language translation; speech recognition; CCG-bank canonical derivations; Combinatory Categorial Grammar; incremental CCG parsing; machine translation; parsers; speech-recognition; standard linear-time decoders; syntactic language model; Assembly; Context modeling; Decoding; Delay; Large-scale systems; Natural languages; Probability; Speech recognition; Tin; Usability; Grammar; Language Modeling; Natural languages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop, 2008. SLT 2008. IEEE
  • Conference_Location
    Goa
  • Print_ISBN
    978-1-4244-3471-8
  • Electronic_ISBN
    978-1-4244-3472-5
  • Type

    conf

  • DOI
    10.1109/SLT.2008.4777876
  • Filename
    4777876