DocumentCode
116090
Title
Phonetic matching and syntactic tree similarity based QA system for SMS queries
Author
Mittal, Anish ; Bhatt, Piyush ; Kumar, Pranaw
Author_Institution
Dept. of Comput. Sci. & Eng., Graphic Era Univ., Dehradun, India
fYear
2014
fDate
6-8 March 2014
Firstpage
1
Lastpage
6
Abstract
Currently there have been many QA systems for SMS queries with large archive but none is addressing the most commonly present phonetic noise in SMS queries. However, Finding similar questions in the QA archive is not trivial especially in the presence of phonetic noise. This paper proposes a solution to handle the noise including phonetic noise. We present a technique to handle semantic variation by developing new phonetic algorithm that uses Soundex and Metaphone algorithm; In addition we have modified Longest Common Subsequence problem to raise the level of similarity between noisy words of SMS and corresponding dictionary words. Following this approach it also handles syntactic variation in question formulation without any SMS normalization.
Keywords
dictionaries; electronic messaging; query processing; question answering (information retrieval); speech processing; trees (mathematics); Metaphone algorithm; QA archive; SMS queries; Soundex algorithm; dictionary words; longest common subsequence problem; phonetic matching; phonetic noise; question formulation; short message service; syntactic tree similarity based QA system; Accuracy; Dictionaries; Kernel; Noise; Noise measurement; Semantics; Syntactics; Noise Handling; SMS queries; Similarity Score; noisy text;
fLanguage
English
Publisher
ieee
Conference_Titel
Green Computing Communication and Electrical Engineering (ICGCCEE), 2014 International Conference on
Conference_Location
Coimbatore
Type
conf
DOI
10.1109/ICGCCEE.2014.6921412
Filename
6921412
Link To Document