• DocumentCode
    2289136
  • Title

    Corpus-based speech and language research in the Institute of Systems Science

  • Author

    Wu, Horng Jyh ; Guo, Jin ; Lui, Ho Chung ; Low, Hwee Boon

  • Author_Institution
    Inst. of Syst. Sci., Nat. Univ. of Singapore, Singapore
  • fYear
    1994
  • fDate
    13-16 Apr 1994
  • Firstpage
    142
  • Abstract
    This paper describes the ongoing and planned research projects on speech and language modeling in the Institute of Systems Science. Four main areas of work have been concentrated and targeted: (1) intonation unit modeling using prosodic features; (2) identification and acquisition of lexical compounds; (3) stochastic dependency grammar parsing; and (4) factual information extraction. These research topics cover full-range of issues from the speech prosody level to the language discourse level. None the less, one consistent theme hinges together requirements from these different levels of processing-that is the so called corpus-based statistical approach. As revealed to us by applying this approach to various application systems, two related characteristics of a practical natural language processing (NLP) system emerge as rather crucial: (1) to prepare a high quality and large amount of tagged corpora as training examples; (2) to identify of a set of tag features most relevant to an application domain
  • Keywords
    grammars; natural languages; research initiatives; speech analysis and processing; statistical analysis; stochastic processes; Institute of Systems Science; corpus-based statistical approach; factual information extraction; intonation unit modeling; language discourse level; language research; lexical compounds acquisition; lexical compounds identification; natural language processing system; prosodic features; speech processing; speech prosody level; stochastic dependency grammar parsing; training examples; Data mining; Engines; Fasteners; Natural language processing; Natural languages; Recruitment; Speech processing; Statistical analysis; Stochastic processes; Tagging;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Speech, Image Processing and Neural Networks, 1994. Proceedings, ISSIPNN '94., 1994 International Symposium on
  • Print_ISBN
    0-7803-1865-X
  • Type

    conf

  • DOI
    10.1109/SIPNN.1994.344946
  • Filename
    344946