DocumentCode :
399562
Title :
Event information extraction using link grammar
Author :
Madhyastha, Harsha V. ; Balakrishnan, N. ; Ramakrishnan, K.R.
Author_Institution :
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol., Madras Chennai, India
fYear :
2003
fDate :
10-11 March 2003
Firstpage :
16
Lastpage :
22
Abstract :
In this paper, we present a scheme for identifying instances of events and extracting information about them. The scheme can handle all events with which an action can be associated, which covers most types of events. Our system basically tries to extract semantic information from the syntactic structure given by the link grammar system described by D. Sleator and D. Temperly (1991) to any English sentence. The instances of events are identified by finding all sentences in the text where the verb, which best represents the action in the event, or one of its synonyms/hyponyms occurs as a main verb. Then, information about that instance of the event is derived using a set of rules which we have developed to identify the subject and object as well as the modifiers of all verbs and nouns in any English sentence, making use of the structure given by the link parser. The scheme was tested on the Reuters corpus and gave recall and precision even up to 100%.
Keywords :
grammars; information retrieval; natural languages; text analysis; English sentence; Reuters corpus; event information extraction; event instance identification; hyponyms; link grammar; link parser; noun modifiers; object identification; semantic information extraction; subject identification; synonyms; syntactic structure; verb modifiers; Computer science; Data mining; Hidden Markov models; Information filtering; Information filters; Internet; Natural languages; Radio access networks; Testing; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Research Issues in Data Engineering: Multi-lingual Information Management, 2003. RIDE-MLIM 2003. Proceedings. 13th International Workshop on
ISSN :
1066-1395
Print_ISBN :
0-7803-7868-7
Type :
conf
DOI :
10.1109/RIDE.2003.1249841
Filename :
1249841
Link To Document :
بازگشت