Title :
Efficiently denoising SMS text for FAQ retrieval
Author :
Batra, Ruhi ; Sharma, Shantanu ; Shrivastav, Anurag ; Goyal, Puneet
Author_Institution :
Dept. of Comput. Sci. & Eng., Graphic Era Univ., Dehradun, India
Abstract :
Several online resources in the form of Frequently Asked Questions (FAQs) provide useful and much needed information across different domains like health, education, banking etc. Community based service has emerged as a powerful resource for information retrieval but its access is restricted to internet only.To access information without internet facility more and more people rely on Short Message Service (SMS) to get an instant answer to their query.Therefore, efforts have been put to improve the SMS based information retrieval system. The text in SMS messages are generally noisy and correcting this noisy text is one of the major challenges that affect the efficiency and accuracy of any SMS based information retrieval system. This paper provides the improvements to the existing algorithms of noise removal in SMS text to obtain better results. Experiments using different test cases show that the proposed system outperforms other methods.
Keywords :
electronic messaging; query processing; text analysis; FAQ retrieval; SMS based information retrieval system; SMS messages; SMS text denoising; frequently asked questions; information access; noise removal; short message service; Databases; Dictionaries; Noise; Noise measurement; Noise reduction; Servers; Algorithms; Information retrieval; Noise removal; Short Message Service (SMS); Similarity measure;
Conference_Titel :
Data Mining and Intelligent Computing (ICDMIC), 2014 International Conference on
Conference_Location :
New Delhi
Print_ISBN :
978-1-4799-4675-4
DOI :
10.1109/ICDMIC.2014.6954237