Information extraction system in large-scale Web

Author

Hong, Fei ; Zhao, Zhuang

Author_Institution

Sch. of Comput. Sci. Coll. of Software Eng., BeiHang Univ., Beijing, China

Volume

fYear

2005

fDate

12-14 Oct. 2005

Firstpage

809

Lastpage

812

Abstract

Manually querying search engines in order to acquire a large body of related information is a tedious, error-prone process. Search engines retrieve and rank potentially relevant documents for human perusal, but do not extract facts, assess confidence, or fuse information from multiple documents. This paper present an information extraction system that aims to automate the tedious process of extracting large collections of facts from large-scale, domain-independent, and scalable manner. The paper focus on four major components: search engine interface, extractor, assessor, database, and further analyzes system architecture and reports on simulation results with large-scale information extraction systems.

Keywords

Internet; search engines; information extraction system; large-scale Web; search engine interface; Analytical models; Data analysis; Data mining; Databases; Fuses; Humans; Information analysis; Information retrieval; Large-scale systems; Search engines;

fLanguage

English

Publisher

ieee

Conference_Titel

Communications and Information Technology, 2005. ISCIT 2005. IEEE International Symposium on

Print_ISBN

0-7803-9538-7

Type

conf

DOI

10.1109/ISCIT.2005.1566990

Filename

1566990

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=2879710