The Content Extraction Method of Webpage Information Based on Knowledge Base

Author

Chen, Guowei ; Zhang, Pengzhou

Author_Institution

MITI Lab., Commun. Univ. of China, Beijing, China

fYear

2012

fDate

23-26 June 2012

Firstpage

623

Lastpage

626

Abstract

Web content extraction is actually the process of transforming web unstructured information into structured information. Knowledge base has the advantages of ordering information and knowledge, also be used conveniently. So it´s convenient to retrieve information and knowledge, and it makes base for effective use. Knowledge base will speed up the knowledge and the flow of information and make for knowledge sharing and communication. This paper puts forward a web information extraction method which is based on the knowledge base. Experiment results show that the method has greatly increased efficiency and accuracy of the web information extraction.

Keywords

Internet; information retrieval; Web content extraction; Web information extraction; Webpage information; communication; knowledge base; knowledge sharing; unstructured information; Accuracy; Data mining; HTML; Internet; Knowledge based systems; Web pages; KA; PA; Semistructured Data; information extraction; knowledge base;

fLanguage

English

Publisher

ieee

Conference_Titel

Computational Sciences and Optimization (CSO), 2012 Fifth International Joint Conference on

Conference_Location

Harbin

Print_ISBN

978-1-4673-1365-0

Type

conf

DOI

10.1109/CSO.2012.142

Filename

6274803

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=3063745