DocumentCode :
586720
Title :
An error probability estimation of the document classification using Markov model
Author :
Kobayashi, Masato ; Ninomiya, Hiroshi ; Matsushima, Takaaki ; Hirasawa, Shoichi
Author_Institution :
Shonan Inst. of Technol., Fujisawa, Japan
fYear :
2012
fDate :
28-31 Oct. 2012
Firstpage :
717
Lastpage :
721
Abstract :
The document classification problem has been investigated by various techniques, such as a vector space model, a support vector machine, a random forest, and so on. On the other hand, J. Ziv et al. have proposed a document classification method using Ziv-Lempel algorithm to compress the data. Furthermore, the Context-Tree Weighting (CTW) algorithm has been proposed as an outstanding data compression, and for the document classification using the CTW algorithm experimental results have been reported. In this paper, we assume that each document with same category arises from Markov model with same parameters for the document classification. Then we propose an analysis method to estimate a classification error probability for the document with the finite length.
Keywords :
Markov processes; data compression; error detection; Markov model; Ziv-Lempel algorithm; classification error probability; context-tree weighting algorithm; document classification; error probability estimation; finite length; outstanding data compression; random forest; support vector machine; vector space model; Approximation methods; Context; Context modeling; Error probability; Estimation; Information theory; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Theory and its Applications (ISITA), 2012 International Symposium on
Conference_Location :
Honolulu, HI
Print_ISBN :
978-1-4673-2521-9
Type :
conf
Filename :
6401034
Link To Document :
بازگشت