• DocumentCode
    3589495
  • Title

    Research on energy-efficient text classification

  • Author

    Hao Lin

  • Author_Institution
    Sch. of Software, Shanghai Jiao Tong Univ., Shanghai, China
  • fYear
    2014
  • Firstpage
    257
  • Lastpage
    261
  • Abstract
    People rely on data mining techniques like text categorization more and more to explore valuable information, due to the ever-increasing electronic documents produced. Although the energy consumed by text categorization increases with the data, people usually pay attention to its effectiveness and there is little research about its energy cost. In this paper, we evaluate the energy cost of different classifiers and reduce energy cost by parallelization, trying to find a classifier that performs best on both aspects - effectiveness and efficiency. Several classifiers are obtained by using existing libraries or implementing classification algorithms. Comprehensive experiments on three real datasets show that an improved version of Naive Bayes can have competitive precision compared to SVM while has low energy costs. Parallelization can further reduce its energy cost by a factor of 10 for RCV1 dataset.
  • Keywords
    energy conservation; pattern classification; text analysis; Naive Bayes; RCV1 dataset; SVM; data mining techniques; energy cost; energy-efficient text classification; Computers; Energy measurement; Estimation; Libraries; Support vector machines; Text categorization; Training; Classification algorithms; Green computing; Parallel processing; Performance evaluation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology and Electronic Commerce (ICITEC), 2014 2nd International Conference on
  • Print_ISBN
    978-1-4799-5298-4
  • Type

    conf

  • DOI
    10.1109/ICITEC.2014.7105614
  • Filename
    7105614