Title :
Hierarchical approach for scientific document classification
Author :
D´cunha, Arlina ; Sen, A.K.
Author_Institution :
Comput. Dept., St. Francis Inst. of Technol., Mumbai, India
Abstract :
Classification is the grouping of information or objects in predefined labeled categories based on similarities. Exponential growth rates of scientific document collection leads to unmanageable manual classification. Feature extraction is the central prerequisite of automatic document classification. TF-IDF (term frequency-inverse document frequency) is commonly used to express the text feature weight. This paper proposes a new feature weighting method by modifying TF-IDF formula.
Keywords :
document handling; pattern classification; TF-IDF formula; feature extraction; hierarchical approach; scientific document classification; scientific document collection; term frequency-inverse document frequency; text feature weight; Artificial intelligence; Automation; Classification algorithms; Feature extraction; Mathematical model; Text categorization; Training; Classification; Scientific document; tf-idf;
Conference_Titel :
Computing, Communication & Automation (ICCCA), 2015 International Conference on
Conference_Location :
Noida
Print_ISBN :
978-1-4799-8889-1
DOI :
10.1109/CCAA.2015.7148351