• DocumentCode
    185577
  • Title

    Discrimination between Serbian and Slovenian language by texture analysis

  • Author

    Brodic, Darko ; Milivojevic, Zoran N. ; Maluckov, Cedomir A. ; Jevtic, Miroljub

  • Author_Institution
    Tech. Fac. in Bor, Univ. of Belgrade, Bor, Serbia
  • fYear
    2014
  • fDate
    26-30 May 2014
  • Firstpage
    1142
  • Lastpage
    1146
  • Abstract
    The paper proposed the method for the language identification according to texture analysis. First, the algorithm encrypted the text given in different languages to the cipher based on the baseline status of each script element in text. Then, the cipher was subjected to the texture analysis. The aim of this analysis was the extraction of texture features. Obtained texture features showed significant diversity between languages. Hence, they were suitable for creating the discrimination and identification criteria between languages. The proposed method was tested on Serbian and Slovenian documents from custom oriented database. The experiments gave encouraging results.
  • Keywords
    natural language processing; text analysis; Serbian language; Slovenian language; cipher; identification criteria; language identification; texture analysis; texture features; Ciphers; Databases; Educational institutions; Entropy; Feature extraction; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information and Communication Technology, Electronics and Microelectronics (MIPRO), 2014 37th International Convention on
  • Conference_Location
    Opatija
  • Print_ISBN
    978-953-233-081-6
  • Type

    conf

  • DOI
    10.1109/MIPRO.2014.6859740
  • Filename
    6859740