A novel similarity algorithm for fixing erroneous turkish text and detection of roots

Author

Ozdemir, C. ; Atas, M.

Author_Institution

M.Y.O. Bilgisayar Programciligi Bolumu, Siirt Univ. Siirt, Siirt, Turkey

fYear

2014

fDate

23-25 April 2014

Firstpage

830

Lastpage

833

Abstract

Finding roots of words is widely used in document classification and text mining. Computational methods of text similarity are intensely utilized on the English words and successful outcomes are obtained. On the other hand, applying the aforementioned methods on the Turkish words did not give the similar success. In this study, a novel similarity computation algorithm is developed. By using this algorithm it is aimed to find correct words or advice possible alternatives from the written erroneous Turkish words as a highest accuracy rate.

Keywords

pattern classification; text analysis; English words; Turkish words; document classification; erroneous Turkish text; roots detection; similarity computation algorithm; text mining; text similarity algorithm; Accuracy; Classification algorithms; Conferences; Noise; Signal processing algorithms; Text mining; Natural language processing; correcting erroneous words; edit distance; root finding; word similarity;

fLanguage

English

Publisher

ieee

Conference_Titel

Signal Processing and Communications Applications Conference (SIU), 2014 22nd

Conference_Location

Trabzon

Type

conf

DOI

10.1109/SIU.2014.6830358

Filename

6830358

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=152471