DocumentCode
2289136
Title
Corpus-based speech and language research in the Institute of Systems Science
Author
Wu, Horng Jyh ; Guo, Jin ; Lui, Ho Chung ; Low, Hwee Boon
Author_Institution
Inst. of Syst. Sci., Nat. Univ. of Singapore, Singapore
fYear
1994
fDate
13-16 Apr 1994
Firstpage
142
Abstract
This paper describes the ongoing and planned research projects on speech and language modeling in the Institute of Systems Science. Four main areas of work have been concentrated and targeted: (1) intonation unit modeling using prosodic features; (2) identification and acquisition of lexical compounds; (3) stochastic dependency grammar parsing; and (4) factual information extraction. These research topics cover full-range of issues from the speech prosody level to the language discourse level. None the less, one consistent theme hinges together requirements from these different levels of processing-that is the so called corpus-based statistical approach. As revealed to us by applying this approach to various application systems, two related characteristics of a practical natural language processing (NLP) system emerge as rather crucial: (1) to prepare a high quality and large amount of tagged corpora as training examples; (2) to identify of a set of tag features most relevant to an application domain
Keywords
grammars; natural languages; research initiatives; speech analysis and processing; statistical analysis; stochastic processes; Institute of Systems Science; corpus-based statistical approach; factual information extraction; intonation unit modeling; language discourse level; language research; lexical compounds acquisition; lexical compounds identification; natural language processing system; prosodic features; speech processing; speech prosody level; stochastic dependency grammar parsing; training examples; Data mining; Engines; Fasteners; Natural language processing; Natural languages; Recruitment; Speech processing; Statistical analysis; Stochastic processes; Tagging;
fLanguage
English
Publisher
ieee
Conference_Titel
Speech, Image Processing and Neural Networks, 1994. Proceedings, ISSIPNN '94., 1994 International Symposium on
Print_ISBN
0-7803-1865-X
Type
conf
DOI
10.1109/SIPNN.1994.344946
Filename
344946
Link To Document