DocumentCode :
152471
Title :
A novel similarity algorithm for fixing erroneous turkish text and detection of roots
Author :
Ozdemir, C. ; Atas, M.
Author_Institution :
M.Y.O. Bilgisayar Programciligi Bolumu, Siirt Univ. Siirt, Siirt, Turkey
fYear :
2014
fDate :
23-25 April 2014
Firstpage :
830
Lastpage :
833
Abstract :
Finding roots of words is widely used in document classification and text mining. Computational methods of text similarity are intensely utilized on the English words and successful outcomes are obtained. On the other hand, applying the aforementioned methods on the Turkish words did not give the similar success. In this study, a novel similarity computation algorithm is developed. By using this algorithm it is aimed to find correct words or advice possible alternatives from the written erroneous Turkish words as a highest accuracy rate.
Keywords :
pattern classification; text analysis; English words; Turkish words; document classification; erroneous Turkish text; roots detection; similarity computation algorithm; text mining; text similarity algorithm; Accuracy; Classification algorithms; Conferences; Noise; Signal processing algorithms; Text mining; Natural language processing; correcting erroneous words; edit distance; root finding; word similarity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Communications Applications Conference (SIU), 2014 22nd
Conference_Location :
Trabzon
Type :
conf
DOI :
10.1109/SIU.2014.6830358
Filename :
6830358
Link To Document :
بازگشت