Title :
Phonetic matching and syntactic tree similarity based QA system for SMS queries
Author :
Mittal, Anish ; Bhatt, Piyush ; Kumar, Pranaw
Author_Institution :
Dept. of Comput. Sci. & Eng., Graphic Era Univ., Dehradun, India
Abstract :
Currently there have been many QA systems for SMS queries with large archive but none is addressing the most commonly present phonetic noise in SMS queries. However, Finding similar questions in the QA archive is not trivial especially in the presence of phonetic noise. This paper proposes a solution to handle the noise including phonetic noise. We present a technique to handle semantic variation by developing new phonetic algorithm that uses Soundex and Metaphone algorithm; In addition we have modified Longest Common Subsequence problem to raise the level of similarity between noisy words of SMS and corresponding dictionary words. Following this approach it also handles syntactic variation in question formulation without any SMS normalization.
Keywords :
dictionaries; electronic messaging; query processing; question answering (information retrieval); speech processing; trees (mathematics); Metaphone algorithm; QA archive; SMS queries; Soundex algorithm; dictionary words; longest common subsequence problem; phonetic matching; phonetic noise; question formulation; short message service; syntactic tree similarity based QA system; Accuracy; Dictionaries; Kernel; Noise; Noise measurement; Semantics; Syntactics; Noise Handling; SMS queries; Similarity Score; noisy text;
Conference_Titel :
Green Computing Communication and Electrical Engineering (ICGCCEE), 2014 International Conference on
Conference_Location :
Coimbatore
DOI :
10.1109/ICGCCEE.2014.6921412