• DocumentCode
    3026015
  • Title

    Hierarchical approach for scientific document classification

  • Author

    D´cunha, Arlina ; Sen, A.K.

  • Author_Institution
    Comput. Dept., St. Francis Inst. of Technol., Mumbai, India
  • fYear
    2015
  • fDate
    15-16 May 2015
  • Firstpage
    100
  • Lastpage
    104
  • Abstract
    Classification is the grouping of information or objects in predefined labeled categories based on similarities. Exponential growth rates of scientific document collection leads to unmanageable manual classification. Feature extraction is the central prerequisite of automatic document classification. TF-IDF (term frequency-inverse document frequency) is commonly used to express the text feature weight. This paper proposes a new feature weighting method by modifying TF-IDF formula.
  • Keywords
    document handling; pattern classification; TF-IDF formula; feature extraction; hierarchical approach; scientific document classification; scientific document collection; term frequency-inverse document frequency; text feature weight; Artificial intelligence; Automation; Classification algorithms; Feature extraction; Mathematical model; Text categorization; Training; Classification; Scientific document; tf-idf;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing, Communication & Automation (ICCCA), 2015 International Conference on
  • Conference_Location
    Noida
  • Print_ISBN
    978-1-4799-8889-1
  • Type

    conf

  • DOI
    10.1109/CCAA.2015.7148351
  • Filename
    7148351