DocumentCode
3026015
Title
Hierarchical approach for scientific document classification
Author
D´cunha, Arlina ; Sen, A.K.
Author_Institution
Comput. Dept., St. Francis Inst. of Technol., Mumbai, India
fYear
2015
fDate
15-16 May 2015
Firstpage
100
Lastpage
104
Abstract
Classification is the grouping of information or objects in predefined labeled categories based on similarities. Exponential growth rates of scientific document collection leads to unmanageable manual classification. Feature extraction is the central prerequisite of automatic document classification. TF-IDF (term frequency-inverse document frequency) is commonly used to express the text feature weight. This paper proposes a new feature weighting method by modifying TF-IDF formula.
Keywords
document handling; pattern classification; TF-IDF formula; feature extraction; hierarchical approach; scientific document classification; scientific document collection; term frequency-inverse document frequency; text feature weight; Artificial intelligence; Automation; Classification algorithms; Feature extraction; Mathematical model; Text categorization; Training; Classification; Scientific document; tf-idf;
fLanguage
English
Publisher
ieee
Conference_Titel
Computing, Communication & Automation (ICCCA), 2015 International Conference on
Conference_Location
Noida
Print_ISBN
978-1-4799-8889-1
Type
conf
DOI
10.1109/CCAA.2015.7148351
Filename
7148351
Link To Document