Title :
Mongolian information retrieval method based on LDA model
Author :
Min Lin Siriguleng;Changbo Tian
Author_Institution :
College of Computer and Information Engineering Inner Mongolia Normal University, Hohhot, Inner Mongolia 010022, China
Abstract :
A new method based on Latent Dirichlet Allocation (LDA) is proposed to retrieval information in Mongolian. Semantic information is also considered by Mongolian documents when consider relationship between keywords and retrieval documents. This method models Mongolian documents with LDA, parameters are estimated with Gibbs sampling and probability of word is represented, it can mine the hidden relationship between the different topics and the words from documents, get the topic distribution and compute the similarity of keywords topics. Finally, return to the most relevant documents with topics. Experimental results show that the method has a higher performance in topic semantic compared with vector space model and Language model.
Keywords :
"Semantics","Information retrieval","Probability distribution","Computational modeling","Education","Resource management","Aerospace electronics"
Conference_Titel :
Software Engineering and Service Science (ICSESS), 2015 6th IEEE International Conference on
Print_ISBN :
978-1-4799-8352-0
Electronic_ISBN :
2327-0594
DOI :
10.1109/ICSESS.2015.7339073