Title :
Document classification method with small training data
Author :
Maeda, Yasunari ; Yoshida, Hideki ; Matsushima, Toshiyasu
Author_Institution :
Dept. of Comput. Sci., Kitami Inst. of Technol., Hokkaido, Japan
Abstract :
Document classification is one of important topics in the field of NLP(Natural Language Processing). In our previous research we´ve proposed a document classification method which minimizes an error rate with reference to a Bayes criterion. But when the number of documents in training data is small, the accuracy of the previous method is low. So in this research we propose a document classification method whose accuracy is higher than the previous method when the number of documents in training data is small.
Keywords :
Bayes methods; classification; document handling; learning (artificial intelligence); natural language processing; Bayes criterion; document classification method; error rate minimization; natural language processing; small training data; Electronic mail; Error analysis; Mathematics; Probability distribution; Training data; document classification; estimating data; prior distributions; small training data;
Conference_Titel :
ICCAS-SICE, 2009
Conference_Location :
Fukuoka
Print_ISBN :
978-4-907764-34-0
Electronic_ISBN :
978-4-907764-33-3