Title :
Information extraction system in large-scale Web
Author :
Hong, Fei ; Zhao, Zhuang
Author_Institution :
Sch. of Comput. Sci. Coll. of Software Eng., BeiHang Univ., Beijing, China
Abstract :
Manually querying search engines in order to acquire a large body of related information is a tedious, error-prone process. Search engines retrieve and rank potentially relevant documents for human perusal, but do not extract facts, assess confidence, or fuse information from multiple documents. This paper present an information extraction system that aims to automate the tedious process of extracting large collections of facts from large-scale, domain-independent, and scalable manner. The paper focus on four major components: search engine interface, extractor, assessor, database, and further analyzes system architecture and reports on simulation results with large-scale information extraction systems.
Keywords :
Internet; search engines; information extraction system; large-scale Web; search engine interface; Analytical models; Data analysis; Data mining; Databases; Fuses; Humans; Information analysis; Information retrieval; Large-scale systems; Search engines;
Conference_Titel :
Communications and Information Technology, 2005. ISCIT 2005. IEEE International Symposium on
Print_ISBN :
0-7803-9538-7
DOI :
10.1109/ISCIT.2005.1566990