DocumentCode
3706670
Title
Did You Know? A Rule-Based Approach to Finding Similar Questions on Online Health Forums
Author
Jianglei Han;Naveen Nandan;Aixin Sun
Author_Institution
SAP Res. &
fYear
2015
Firstpage
513
Lastpage
514
Abstract
This paper describes our system submitted for the ICHI 2015 Healthcare Data Analytics Challenge. Given a relatively large corpus of questions posted by users on online health forums, for a newly posted question (i.e., Query question), our task is to find three most similar questions from the corpus. Our system employs Elastic search, a search server based on Lucene, at its core. The corpus of existing questions is indexed with n-grams. To search for most similar questions, the query question is re-written to a keyword-based query based on rules by considering multiple text components including title, key phrases, and noun phrases extracted from the question content.
Keywords
"Indexing","Medical services","Tagging","Boosting","Technological innovation","Poles and towers"
Publisher
ieee
Conference_Titel
Healthcare Informatics (ICHI), 2015 International Conference on
Type
conf
DOI
10.1109/ICHI.2015.94
Filename
7349756
Link To Document