• DocumentCode
    506574
  • Title

    Kernel- based Chinese recognition with ontology

  • Author

    Shuxia, Pang ; Rui, Li ; Zhanting, Yuan ; Qiuyu, Zhang

  • Author_Institution
    Sch. of Comput. & Commun., Lanzhou Univ. of Technol., Lanzhou, China
  • Volume
    1
  • fYear
    2009
  • fDate
    20-22 Nov. 2009
  • Firstpage
    443
  • Lastpage
    446
  • Abstract
    Since the Chinese Websites have increased in the explosive Internet era, making efficient information retrieval systems has become one of the major endeavors, especially in fields of Chinese recognition. In this paper, the authors study the integration of subsequence kernel function based on ontology. Using the vector space model (VSM) to create subsequence kernels, the kernel methodology described here not only overcomes the VSM ignoring any semantic relation between words, but also results both in functional similarity and in sequence/words similarity by gap-weighted subsequences kernels, and the most important is that semantic character is also taken into account, which is very useful for Chinese recognition on Internet. Experiments show that the method has more exact retrieval results, and its cost is under the accepted tolerance.
  • Keywords
    Internet; Web sites; character recognition; information retrieval systems; natural language processing; ontologies (artificial intelligence); Chinese Web sites; Internet; functional similarity; gap-weighted subsequences kernels; information retrieval systems; kernel-based Chinese character recognition; ontology; sequence similarity; subsequence kernel function; vector space model; words similarity; Character recognition; Costs; Data mining; Explosives; Humans; Information retrieval; Internet; Kernel; Ontologies; Text recognition; chinese recognition; kernel; ontology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Computing and Intelligent Systems, 2009. ICIS 2009. IEEE International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4244-4754-1
  • Electronic_ISBN
    978-1-4244-4738-1
  • Type

    conf

  • DOI
    10.1109/ICICISYS.2009.5357807
  • Filename
    5357807