DocumentCode
185577
Title
Discrimination between Serbian and Slovenian language by texture analysis
Author
Brodic, Darko ; Milivojevic, Zoran N. ; Maluckov, Cedomir A. ; Jevtic, Miroljub
Author_Institution
Tech. Fac. in Bor, Univ. of Belgrade, Bor, Serbia
fYear
2014
fDate
26-30 May 2014
Firstpage
1142
Lastpage
1146
Abstract
The paper proposed the method for the language identification according to texture analysis. First, the algorithm encrypted the text given in different languages to the cipher based on the baseline status of each script element in text. Then, the cipher was subjected to the texture analysis. The aim of this analysis was the extraction of texture features. Obtained texture features showed significant diversity between languages. Hence, they were suitable for creating the discrimination and identification criteria between languages. The proposed method was tested on Serbian and Slovenian documents from custom oriented database. The experiments gave encouraging results.
Keywords
natural language processing; text analysis; Serbian language; Slovenian language; cipher; identification criteria; language identification; texture analysis; texture features; Ciphers; Databases; Educational institutions; Entropy; Feature extraction; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Information and Communication Technology, Electronics and Microelectronics (MIPRO), 2014 37th International Convention on
Conference_Location
Opatija
Print_ISBN
978-953-233-081-6
Type
conf
DOI
10.1109/MIPRO.2014.6859740
Filename
6859740
Link To Document