DocumentCode
3063745
Title
The Content Extraction Method of Webpage Information Based on Knowledge Base
Author
Chen, Guowei ; Zhang, Pengzhou
Author_Institution
MITI Lab., Commun. Univ. of China, Beijing, China
fYear
2012
fDate
23-26 June 2012
Firstpage
623
Lastpage
626
Abstract
Web content extraction is actually the process of transforming web unstructured information into structured information. Knowledge base has the advantages of ordering information and knowledge, also be used conveniently. So it´s convenient to retrieve information and knowledge, and it makes base for effective use. Knowledge base will speed up the knowledge and the flow of information and make for knowledge sharing and communication. This paper puts forward a web information extraction method which is based on the knowledge base. Experiment results show that the method has greatly increased efficiency and accuracy of the web information extraction.
Keywords
Internet; information retrieval; Web content extraction; Web information extraction; Webpage information; communication; knowledge base; knowledge sharing; unstructured information; Accuracy; Data mining; HTML; Internet; Knowledge based systems; Web pages; KA; PA; Semistructured Data; information extraction; knowledge base;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Sciences and Optimization (CSO), 2012 Fifth International Joint Conference on
Conference_Location
Harbin
Print_ISBN
978-1-4673-1365-0
Type
conf
DOI
10.1109/CSO.2012.142
Filename
6274803
Link To Document