Title :
Automatic standardization of spelling variations of Hindi text
Author :
Goyal, Vishal ; Lehal, Gurpreet Singh
Author_Institution :
Dept. of Comput. Sci., Punjabi Univ., Patiala, India
Abstract :
The phonetic nature of Indian languages and multiple dialects, transliteration of proper names, words borrowed from foreign languages has resulted in spelling variations of the same word. Such variations sometimes can be treated as errors in writing. While developing machine translation system, the task of standardizing the spellings for further processing the text is considered to play vital role in improving the accuracy of translation. In this paper, the rule based approach for standardizing spelling variations in the Hindi text while developing Hindi to Punjabi Machine Translation System has been explained. It was analyzed that only 7.45% text was standardized using this approach and thus had increased the accuracy of the machine translation system.
Keywords :
language translation; natural language processing; text analysis; Hindi text; Hindi-Punjabi machine translation system; Indian languages; rule based approach; spelling variation standardization; Accuracy; Databases; Dictionaries; Knowledge based systems; Speech; Speech recognition; Standardization; Machine Translation; Natural Language Processing; Preprocessing Module; Standardizing spelling variations; Text Normalization;
Conference_Titel :
Computer and Communication Technology (ICCCT), 2010 International Conference on
Conference_Location :
Allahabad, Uttar Pradesh
Print_ISBN :
978-1-4244-9033-2
DOI :
10.1109/ICCCT.2010.5640441