• DocumentCode
    152471
  • Title

    A novel similarity algorithm for fixing erroneous turkish text and detection of roots

  • Author

    Ozdemir, C. ; Atas, M.

  • Author_Institution
    M.Y.O. Bilgisayar Programciligi Bolumu, Siirt Univ. Siirt, Siirt, Turkey
  • fYear
    2014
  • fDate
    23-25 April 2014
  • Firstpage
    830
  • Lastpage
    833
  • Abstract
    Finding roots of words is widely used in document classification and text mining. Computational methods of text similarity are intensely utilized on the English words and successful outcomes are obtained. On the other hand, applying the aforementioned methods on the Turkish words did not give the similar success. In this study, a novel similarity computation algorithm is developed. By using this algorithm it is aimed to find correct words or advice possible alternatives from the written erroneous Turkish words as a highest accuracy rate.
  • Keywords
    pattern classification; text analysis; English words; Turkish words; document classification; erroneous Turkish text; roots detection; similarity computation algorithm; text mining; text similarity algorithm; Accuracy; Classification algorithms; Conferences; Noise; Signal processing algorithms; Text mining; Natural language processing; correcting erroneous words; edit distance; root finding; word similarity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Communications Applications Conference (SIU), 2014 22nd
  • Conference_Location
    Trabzon
  • Type

    conf

  • DOI
    10.1109/SIU.2014.6830358
  • Filename
    6830358