Title :
A novel similarity algorithm for fixing erroneous turkish text and detection of roots
Author :
Ozdemir, C. ; Atas, M.
Author_Institution :
M.Y.O. Bilgisayar Programciligi Bolumu, Siirt Univ. Siirt, Siirt, Turkey
Abstract :
Finding roots of words is widely used in document classification and text mining. Computational methods of text similarity are intensely utilized on the English words and successful outcomes are obtained. On the other hand, applying the aforementioned methods on the Turkish words did not give the similar success. In this study, a novel similarity computation algorithm is developed. By using this algorithm it is aimed to find correct words or advice possible alternatives from the written erroneous Turkish words as a highest accuracy rate.
Keywords :
pattern classification; text analysis; English words; Turkish words; document classification; erroneous Turkish text; roots detection; similarity computation algorithm; text mining; text similarity algorithm; Accuracy; Classification algorithms; Conferences; Noise; Signal processing algorithms; Text mining; Natural language processing; correcting erroneous words; edit distance; root finding; word similarity;
Conference_Titel :
Signal Processing and Communications Applications Conference (SIU), 2014 22nd
Conference_Location :
Trabzon
DOI :
10.1109/SIU.2014.6830358