• DocumentCode
    1723280
  • Title

    News topics categorization using latent Dirichlet allocation and sparse representation classifier

  • Author

    Yuan-Shan Lee ; Lo, Rocky ; Chia-Yen Chen ; Po-Chuan Lin ; Jia-Ching Wang

  • fYear
    2015
  • Firstpage
    136
  • Lastpage
    137
  • Abstract
    Recently, subscribing news from websites has become a new trend for many Internet users. In a news reading browser, it is essential all the news documents are properly categorized. For automatically categorizing the news topics, this paper presents a news categorization method using latent Dirichlet allocation (LDA) and sparse representation classifier (SRC). In our work, the LDA is used as the feature learning method. The multinomial distribution of the news topics is regarded as the feature of the document. These features are stacked as an over-complete dictionary, permitting us to perform SRC-based categorization. The experimental results show that the proposed method outperforms the traditional method.
  • Keywords
    Internet; Web sites; learning (artificial intelligence); pattern classification; Internet; LDA; SRC-based categorization; Web sites; feature learning method; latent Dirichlet allocation; news categorization method; news topics multinomial distribution; over-complete dictionary; sparse representation classifier; topics categorization method; Computer science; Dictionaries; Resource management; Support vector machines; Testing; Training data; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Consumer Electronics - Taiwan (ICCE-TW), 2015 IEEE International Conference on
  • Conference_Location
    Taipei
  • Type

    conf

  • DOI
    10.1109/ICCE-TW.2015.7216819
  • Filename
    7216819