Title :
Three term weighting and classification algorithms in text automatic classification
Author_Institution :
Shanghai Jiaotong Univ., China
Abstract :
Three automatic text classification algorithms are provided. They are the Bayes method based on Bayes theorem and IDF (Invert Document Frequency), VSM based on Shannon entropy and a fuzzy method based on fuzzy theory. Furthermore, the method of combining term weighting methods with three classification algorithms is also provided in the paper.
Keywords :
Bayes methods; classification; entropy; fuzzy set theory; text analysis; Bayes method; Bayes theorem; Invert Document Frequency; Shannon entropy; VSM; automatic text classification algorithms; fuzzy method; fuzzy theory; term weighting algorithms;
Conference_Titel :
High Performance Computing in the Asia-Pacific Region, 2000. Proceedings. The Fourth International Conference/Exhibition on
Conference_Location :
Beijing, China
Print_ISBN :
0-7695-0589-2
DOI :
10.1109/HPC.2000.843510