DocumentCode
506574
Title
Kernel- based Chinese recognition with ontology
Author
Shuxia, Pang ; Rui, Li ; Zhanting, Yuan ; Qiuyu, Zhang
Author_Institution
Sch. of Comput. & Commun., Lanzhou Univ. of Technol., Lanzhou, China
Volume
1
fYear
2009
fDate
20-22 Nov. 2009
Firstpage
443
Lastpage
446
Abstract
Since the Chinese Websites have increased in the explosive Internet era, making efficient information retrieval systems has become one of the major endeavors, especially in fields of Chinese recognition. In this paper, the authors study the integration of subsequence kernel function based on ontology. Using the vector space model (VSM) to create subsequence kernels, the kernel methodology described here not only overcomes the VSM ignoring any semantic relation between words, but also results both in functional similarity and in sequence/words similarity by gap-weighted subsequences kernels, and the most important is that semantic character is also taken into account, which is very useful for Chinese recognition on Internet. Experiments show that the method has more exact retrieval results, and its cost is under the accepted tolerance.
Keywords
Internet; Web sites; character recognition; information retrieval systems; natural language processing; ontologies (artificial intelligence); Chinese Web sites; Internet; functional similarity; gap-weighted subsequences kernels; information retrieval systems; kernel-based Chinese character recognition; ontology; sequence similarity; subsequence kernel function; vector space model; words similarity; Character recognition; Costs; Data mining; Explosives; Humans; Information retrieval; Internet; Kernel; Ontologies; Text recognition; chinese recognition; kernel; ontology;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Computing and Intelligent Systems, 2009. ICIS 2009. IEEE International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-4244-4754-1
Electronic_ISBN
978-1-4244-4738-1
Type
conf
DOI
10.1109/ICICISYS.2009.5357807
Filename
5357807
Link To Document