DocumentCode
2879710
Title
Information extraction system in large-scale Web
Author
Hong, Fei ; Zhao, Zhuang
Author_Institution
Sch. of Comput. Sci. Coll. of Software Eng., BeiHang Univ., Beijing, China
Volume
2
fYear
2005
fDate
12-14 Oct. 2005
Firstpage
809
Lastpage
812
Abstract
Manually querying search engines in order to acquire a large body of related information is a tedious, error-prone process. Search engines retrieve and rank potentially relevant documents for human perusal, but do not extract facts, assess confidence, or fuse information from multiple documents. This paper present an information extraction system that aims to automate the tedious process of extracting large collections of facts from large-scale, domain-independent, and scalable manner. The paper focus on four major components: search engine interface, extractor, assessor, database, and further analyzes system architecture and reports on simulation results with large-scale information extraction systems.
Keywords
Internet; search engines; information extraction system; large-scale Web; search engine interface; Analytical models; Data analysis; Data mining; Databases; Fuses; Humans; Information analysis; Information retrieval; Large-scale systems; Search engines;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications and Information Technology, 2005. ISCIT 2005. IEEE International Symposium on
Print_ISBN
0-7803-9538-7
Type
conf
DOI
10.1109/ISCIT.2005.1566990
Filename
1566990
Link To Document