DocumentCode
584741
Title
Question classification in Persian language based on conditional random fields
Author
Mollaei, Ali ; Rahati-Quchani, Saeed ; Estaji, Azam
Author_Institution
Islamic Azad Univ., Mashhad, Iran
fYear
2012
fDate
18-19 Oct. 2012
Firstpage
295
Lastpage
300
Abstract
The question classification system is one of the important subsystems in the Question Answering Systems (QAS). In such systems through retrieval methods and information extraction the texts are retrieved in order to get to a correct answer. The current study is designed to present the architecture of question classification (QC) in Persian based on the Conditional Random Fields (CRF) machine learning model and evaluate effects of various features on its accuracy. In this study, sentences were classified into two levels of coarse and fine classes based on the type of the answer to each question. After extracting features and setting sliding window on the CRF model, CRF question classifier (QC) is train. Then, the QC predicts labels for every token in question. Next, a majority voting on the question classification output, is used to extract a unique label for each question. Further, the effects of different features on the ultimate accuracy of the system were evaluated. Finally results of this question classifier, illustrate a satisfactory accuracy.
Keywords
learning (artificial intelligence); natural language processing; pattern classification; question answering (information retrieval); CRF machine learning model; Persian language; QAS; coarse sentence; conditional random field; fine sentence; information extraction; majority voting; question answering system; question classification; retrieval method; Accuracy; Cities and towns; Feature extraction; Hidden Markov models; Machine learning; Semantics; Training; conditional random fields; majority voting; question answering system; question classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Knowledge Engineering (ICCKE), 2012 2nd International eConference on
Conference_Location
Mashhad
Print_ISBN
978-1-4673-4475-3
Type
conf
DOI
10.1109/ICCKE.2012.6395395
Filename
6395395
Link To Document