DocumentCode
2294949
Title
Restricted-domain Chinese automatic question-answering system based on question sentence similarity
Author
Zheng-Tao Yu ; Xiao-Zhong Fan ; Peng-Cheng Ji
Author_Institution
Dept. of Comput. Sci. & Eng., Beijing Inst. of Technol., China
Volume
5
fYear
2004
fDate
26-29 Aug. 2004
Firstpage
3023
Abstract
It is an available pattern to implement auto answer in RDAQAS (restricted-domain automatic question-answering system) through calculating the similarity of target question sentences and question sentences in question sentences corpus, and then finding the most similar question sentences, and retrieving the answer finally and all these are based on HowNet and domain ontology. This paper introduces the building of financial domain ontology and question sentences corpus, then proposes the method to calculate similarity of question sentences based on keyword vector space method and semantic concept vector space method. The procedure of realization is described in details. The learning algorithm and learning course of getting question sentence semantic vectors based on the maximum entropy model are also introduced in detail. Finally, the experimental comparing data illustrates that the similarity calculation method based on the semantic concept is more superior to that based on the keyword.
Keywords
financial data processing; learning (artificial intelligence); maximum entropy methods; ontologies (artificial intelligence); HowNet knowledge database; financial domain ontology; keyword vector space method; learning algorithm; maximum entropy model; question sentence similarity; question sentences corpus; restricted domain Chinese automatic question answering system; semantic concept vector space method; Automation; Computer science; Content based retrieval; Data mining; Databases; Dictionaries; Entropy; Finance; Information retrieval; Ontologies;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
Print_ISBN
0-7803-8403-2
Type
conf
DOI
10.1109/ICMLC.2004.1378551
Filename
1378551
Link To Document