• DocumentCode
    596127
  • Title

    Discovering Domain Concepts and Hyponymy Relations by Text Relevance Classifying Based Iterative Web Searching

  • Author

    Lili Mou ; Ge Li ; Zhi Jin ; Yangyang Lu ; Yiyang Hao

  • Author_Institution
    Software Inst., Peking Univ., Beijing, China
  • Volume
    1
  • fYear
    2012
  • fDate
    4-7 Dec. 2012
  • Firstpage
    213
  • Lastpage
    222
  • Abstract
    Domain concepts and taxonomic relationships are an essential part of a domain ontology. They are used in a number of applications, including natural language processing, information retrieval, knowledge management and so on. Nowadays, with the continuous permeation of various kinds of Internet knowledge applications, numerous new concepts are emerged and released on to the Internet. So, the Internet has become an invaluable source of new concepts for almost every possible domain of knowledge. In order to ensure the domain ontologies keep pace with fast changing knowledge, we proposed an web searching based concepts and taxonomic relationships discovering approach. By our approach, the potential concepts on the Internet, which are taxonomically related with the give seeds concepts, can be discovered autonomously and iteratively. In this paper, the approach and a corresponding application in Chinese web pages are reported in detail. The experiments show that, our approach can catch the related domain concepts precisely, meanwhile, can reject irrelevant concepts and figure out the domain knowledge border definitely.
  • Keywords
    Internet; Web sites; information retrieval; knowledge management; natural language processing; ontologies (artificial intelligence); pattern classification; search engines; text analysis; Chinese Web page; Internet knowledge application; Web searching based concept; domain concept; domain ontology; hyponymy relation; information retrieval; iterative Web searching; knowledge management; natural language processing; taxonomic relationships discovering approach; text relevance classification; Biology; Chemicals; Computer languages; Internet; Ontologies; Web pages; domain knowledge; ontology learning; taxonomy learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering Conference (APSEC), 2012 19th Asia-Pacific
  • Conference_Location
    Hong Kong
  • ISSN
    1530-1362
  • Print_ISBN
    978-1-4673-4930-7
  • Type

    conf

  • DOI
    10.1109/APSEC.2012.96
  • Filename
    6462656