• DocumentCode
    1601808
  • Title

    An automatic acquisition of domain knowledge from list-structrued text in Baidu encyclopedia

  • Author

    Wenjuan Wu ; Tao, Liu ; He Hu ; Xiaoyong Du

  • Author_Institution
    Sch. of Inf., Renmin Univ. of China, Beijing, China
  • fYear
    2010
  • Firstpage
    291
  • Lastpage
    298
  • Abstract
    We propose a novel method which can automatically extract new concepts and semantic relations between concepts, in order to support the domain ontology evolvement. We collect the corpus from a free Chinese encyclopedia called Baidu encyclopedia, which is similar to Wikipedia. We locate lists from the Baidu encyclopedia, and extract domain knowledge from the lists. Further more, we use a knowledge assessor to ensure the validity of extracted knowledge. In the experiments, we make a practical attempt to evolve the Chinese Law Ontology (CLO V0), and show that our method can improve the completeness and coverage of CLO V0.
  • Keywords
    encyclopaedias; information retrieval; ontologies (artificial intelligence); Baidu encyclopedia; Chinese encyclopedia; Chinese law ontology; Wikipedia; automatic acquisition; domain knowledge extraction; domain ontology evolvement; list-structrued text; Data mining; Encyclopedias; HTML; Ontologies; Semantics; Web pages; World Wide Web; baidu encyclopedia; knowledge extraction; list wrapper;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Universal Communication Symposium (IUCS), 2010 4th International
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-7821-7
  • Type

    conf

  • DOI
    10.1109/IUCS.2010.5666008
  • Filename
    5666008