DocumentCode :
185577
Title :
Discrimination between Serbian and Slovenian language by texture analysis
Author :
Brodic, Darko ; Milivojevic, Zoran N. ; Maluckov, Cedomir A. ; Jevtic, Miroljub
Author_Institution :
Tech. Fac. in Bor, Univ. of Belgrade, Bor, Serbia
fYear :
2014
fDate :
26-30 May 2014
Firstpage :
1142
Lastpage :
1146
Abstract :
The paper proposed the method for the language identification according to texture analysis. First, the algorithm encrypted the text given in different languages to the cipher based on the baseline status of each script element in text. Then, the cipher was subjected to the texture analysis. The aim of this analysis was the extraction of texture features. Obtained texture features showed significant diversity between languages. Hence, they were suitable for creating the discrimination and identification criteria between languages. The proposed method was tested on Serbian and Slovenian documents from custom oriented database. The experiments gave encouraging results.
Keywords :
natural language processing; text analysis; Serbian language; Slovenian language; cipher; identification criteria; language identification; texture analysis; texture features; Ciphers; Databases; Educational institutions; Entropy; Feature extraction; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Communication Technology, Electronics and Microelectronics (MIPRO), 2014 37th International Convention on
Conference_Location :
Opatija
Print_ISBN :
978-953-233-081-6
Type :
conf
DOI :
10.1109/MIPRO.2014.6859740
Filename :
6859740
Link To Document :
بازگشت