Title :
Latent Dirichlet Allocation based multilevel classification
Author :
Bhutada, Sunil ; Balaram, V.V.S.S.S. ; Bulusu, Vishnu Vardhan
Author_Institution :
Dept. of IT, SNIST, Hyderabad, India
Abstract :
Information processing and knowledge extraction are the two key factors for mining technique. Many models were proposed and implemented successfully on the available information over internet. Automatic Categorization is a machine learning approach which is important for the information processing. In this paper an attempt is made to propose a multilevel classification model using Latent Dirichlet Allocation (LDA) approach. Though the existence of Latent Dirichlet Allocation (LDA) is observed in the literature but a modified model for multilevel classification is presented which is independent of any language. In order to achieve such model many existing proposals were considered like PLSI, which uses the Exceptional Maximization (EM) method only to train the latent classes. The iterative process of Latent Dirichlet Allocation (LDA), which yields the multilevel classification of the corpus. Topic Modeling is used to discover the hidden things that pervade the collection to annotate the documents according to the new topics.
Keywords :
data mining; iterative methods; learning (artificial intelligence); pattern classification; statistical analysis; EM method; automatic categorization; exceptional maximization method; information processing; iterative process; knowledge extraction; latent Dirichlet allocation; machine learning approach; mining technique; multilevel classification; topic modeling; Computational modeling; Instruments; Internet; Large scale integration; Probabilistic logic; Resource management; Semantics; Latent Dirichlet Allocation; PLSI; Topic Modeling; Vector Space Model;
Conference_Titel :
Control, Instrumentation, Communication and Computational Technologies (ICCICCT), 2014 International Conference on
Conference_Location :
Kanyakumari
Print_ISBN :
978-1-4799-4191-9
DOI :
10.1109/ICCICCT.2014.6993109