Title :
Automated dictionary construction for information extraction from text
Author :
Riloff, Ellen ; Lehnert, Wendy
Author_Institution :
Dept. of Comput. Sci., Massachusetts Univ., Amherst, MA, USA
Abstract :
The authors have developed a tool called AutoSlog that automatically constructs domain-specific dictionaries given a set of annotated training texts. Using AutoSlog, a first-year graduate student who had minimal experience with the CIRCUS sentence analyzer on which AutoSlog is based, created a dictionary for the domain of terrorism in 8 hours. In the experiments, the 8-hour AutoSlog dictionary achieved 90% of the performance of a hand-crafted dictionary that required 1500 person-hours of effort by 2 advanced graduate students who were highly skilled with the sentence analyzer
Keywords :
character recognition; feature extraction; glossaries; word processing; AutoSlog; CIRCUS sentence analyzer; annotated training texts; automated dictionary construction; domain-specific dictionaries; information extraction; Computer science; Data mining; Dictionaries; Industrial training; Knowledge engineering; Natural language processing; Performance analysis; Scalability; Terrorism; Text analysis;
Conference_Titel :
Artificial Intelligence for Applications, 1993. Proceedings., Ninth Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
0-8186-3840-0
DOI :
10.1109/CAIA.1993.366656