DocumentCode
152471
Title
A novel similarity algorithm for fixing erroneous turkish text and detection of roots
Author
Ozdemir, C. ; Atas, M.
Author_Institution
M.Y.O. Bilgisayar Programciligi Bolumu, Siirt Univ. Siirt, Siirt, Turkey
fYear
2014
fDate
23-25 April 2014
Firstpage
830
Lastpage
833
Abstract
Finding roots of words is widely used in document classification and text mining. Computational methods of text similarity are intensely utilized on the English words and successful outcomes are obtained. On the other hand, applying the aforementioned methods on the Turkish words did not give the similar success. In this study, a novel similarity computation algorithm is developed. By using this algorithm it is aimed to find correct words or advice possible alternatives from the written erroneous Turkish words as a highest accuracy rate.
Keywords
pattern classification; text analysis; English words; Turkish words; document classification; erroneous Turkish text; roots detection; similarity computation algorithm; text mining; text similarity algorithm; Accuracy; Classification algorithms; Conferences; Noise; Signal processing algorithms; Text mining; Natural language processing; correcting erroneous words; edit distance; root finding; word similarity;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing and Communications Applications Conference (SIU), 2014 22nd
Conference_Location
Trabzon
Type
conf
DOI
10.1109/SIU.2014.6830358
Filename
6830358
Link To Document