DocumentCode :
2879710
Title :
Information extraction system in large-scale Web
Author :
Hong, Fei ; Zhao, Zhuang
Author_Institution :
Sch. of Comput. Sci. Coll. of Software Eng., BeiHang Univ., Beijing, China
Volume :
2
fYear :
2005
fDate :
12-14 Oct. 2005
Firstpage :
809
Lastpage :
812
Abstract :
Manually querying search engines in order to acquire a large body of related information is a tedious, error-prone process. Search engines retrieve and rank potentially relevant documents for human perusal, but do not extract facts, assess confidence, or fuse information from multiple documents. This paper present an information extraction system that aims to automate the tedious process of extracting large collections of facts from large-scale, domain-independent, and scalable manner. The paper focus on four major components: search engine interface, extractor, assessor, database, and further analyzes system architecture and reports on simulation results with large-scale information extraction systems.
Keywords :
Internet; search engines; information extraction system; large-scale Web; search engine interface; Analytical models; Data analysis; Data mining; Databases; Fuses; Humans; Information analysis; Information retrieval; Large-scale systems; Search engines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications and Information Technology, 2005. ISCIT 2005. IEEE International Symposium on
Print_ISBN :
0-7803-9538-7
Type :
conf
DOI :
10.1109/ISCIT.2005.1566990
Filename :
1566990
Link To Document :
بازگشت