DocumentCode :
1723280
Title :
News topics categorization using latent Dirichlet allocation and sparse representation classifier
Author :
Yuan-Shan Lee ; Lo, Rocky ; Chia-Yen Chen ; Po-Chuan Lin ; Jia-Ching Wang
fYear :
2015
Firstpage :
136
Lastpage :
137
Abstract :
Recently, subscribing news from websites has become a new trend for many Internet users. In a news reading browser, it is essential all the news documents are properly categorized. For automatically categorizing the news topics, this paper presents a news categorization method using latent Dirichlet allocation (LDA) and sparse representation classifier (SRC). In our work, the LDA is used as the feature learning method. The multinomial distribution of the news topics is regarded as the feature of the document. These features are stacked as an over-complete dictionary, permitting us to perform SRC-based categorization. The experimental results show that the proposed method outperforms the traditional method.
Keywords :
Internet; Web sites; learning (artificial intelligence); pattern classification; Internet; LDA; SRC-based categorization; Web sites; feature learning method; latent Dirichlet allocation; news categorization method; news topics multinomial distribution; over-complete dictionary; sparse representation classifier; topics categorization method; Computer science; Dictionaries; Resource management; Support vector machines; Testing; Training data; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Consumer Electronics - Taiwan (ICCE-TW), 2015 IEEE International Conference on
Conference_Location :
Taipei
Type :
conf
DOI :
10.1109/ICCE-TW.2015.7216819
Filename :
7216819
Link To Document :
بازگشت