DocumentCode :
3026015
Title :
Hierarchical approach for scientific document classification
Author :
D´cunha, Arlina ; Sen, A.K.
Author_Institution :
Comput. Dept., St. Francis Inst. of Technol., Mumbai, India
fYear :
2015
fDate :
15-16 May 2015
Firstpage :
100
Lastpage :
104
Abstract :
Classification is the grouping of information or objects in predefined labeled categories based on similarities. Exponential growth rates of scientific document collection leads to unmanageable manual classification. Feature extraction is the central prerequisite of automatic document classification. TF-IDF (term frequency-inverse document frequency) is commonly used to express the text feature weight. This paper proposes a new feature weighting method by modifying TF-IDF formula.
Keywords :
document handling; pattern classification; TF-IDF formula; feature extraction; hierarchical approach; scientific document classification; scientific document collection; term frequency-inverse document frequency; text feature weight; Artificial intelligence; Automation; Classification algorithms; Feature extraction; Mathematical model; Text categorization; Training; Classification; Scientific document; tf-idf;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computing, Communication & Automation (ICCCA), 2015 International Conference on
Conference_Location :
Noida
Print_ISBN :
978-1-4799-8889-1
Type :
conf
DOI :
10.1109/CCAA.2015.7148351
Filename :
7148351
Link To Document :
بازگشت