• DocumentCode
    3063745
  • Title

    The Content Extraction Method of Webpage Information Based on Knowledge Base

  • Author

    Chen, Guowei ; Zhang, Pengzhou

  • Author_Institution
    MITI Lab., Commun. Univ. of China, Beijing, China
  • fYear
    2012
  • fDate
    23-26 June 2012
  • Firstpage
    623
  • Lastpage
    626
  • Abstract
    Web content extraction is actually the process of transforming web unstructured information into structured information. Knowledge base has the advantages of ordering information and knowledge, also be used conveniently. So it´s convenient to retrieve information and knowledge, and it makes base for effective use. Knowledge base will speed up the knowledge and the flow of information and make for knowledge sharing and communication. This paper puts forward a web information extraction method which is based on the knowledge base. Experiment results show that the method has greatly increased efficiency and accuracy of the web information extraction.
  • Keywords
    Internet; information retrieval; Web content extraction; Web information extraction; Webpage information; communication; knowledge base; knowledge sharing; unstructured information; Accuracy; Data mining; HTML; Internet; Knowledge based systems; Web pages; KA; PA; Semistructured Data; information extraction; knowledge base;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Sciences and Optimization (CSO), 2012 Fifth International Joint Conference on
  • Conference_Location
    Harbin
  • Print_ISBN
    978-1-4673-1365-0
  • Type

    conf

  • DOI
    10.1109/CSO.2012.142
  • Filename
    6274803