DocumentCode
2439949
Title
Research on Multi-classification and Multi-label in Text Categorization
Author
Hua, Liu
Author_Institution
Coll. of Chinese Language & Culture, Jinan Univ., Guangzhou, China
Volume
2
fYear
2009
fDate
26-27 Aug. 2009
Firstpage
86
Lastpage
89
Abstract
Aiming at multi-classification and multi-label in text categorization, an apery algorithm is proposed which judges whether document has multi-classification and multi-label by estimating the similarity difference among final classifier values. If the quotient of the biggest category´s classifier value divided by the second biggest category´s classifier value is less than or equal to a threshold, the document belongs to two categories. The optimum threshold is set to 1.4 by experiment, and experiment results demonstrate performance increases by 1.42 percent.
Keywords
data mining; learning (artificial intelligence); pattern classification; text analysis; apery algorithm; final classifier values; machine learning; multiclassification problem; multilabel problem; optimum threshold; text categorization; Cybernetics; Data mining; Educational institutions; Electronic mail; Humans; Intelligent systems; Machine learning; Man machine systems; Natural languages; Text categorization; multi-classification and multi-label; text categorization; threshold;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Human-Machine Systems and Cybernetics, 2009. IHMSC '09. International Conference on
Conference_Location
Hangzhou, Zhejiang
Print_ISBN
978-0-7695-3752-8
Type
conf
DOI
10.1109/IHMSC.2009.147
Filename
5336038
Link To Document