DocumentCode :
3342481
Title :
Intertextual distance for Arabic texts classification
Author :
Ayadi, R. ; Maraoui, M. ; Zrigui, M.
Author_Institution :
UTIC Lab., ISIM-Sfax Inst., Sfax, Tunisia
fYear :
2009
fDate :
9-12 Nov. 2009
Firstpage :
1
Lastpage :
6
Abstract :
Our researches works are interested on the application of the intertextual distance theory on the Arabic language as a tool for the classification of texts. This theory assumes the classification of texts according to criteria of lexical statistics, and it is based on the lexical connection approach. Our objective is to integrate this theory as a tool of classification of texts in Arabic language. It requires the integration of a metrics for the classification of texts using a database of lemmatized and identified corpus which can be considered as a literature reference for times, kinds, literary themes and authors and this in order to permit the classification of anonymous texts.
Keywords :
natural languages; pattern classification; statistical analysis; text analysis; Arabic texts classification; anonymous text classification; intertextual distance theory; lexical connection approach; lexical statistics; Character recognition; Databases; Fusion power generation; Indexing; Laboratories; Merging; Statistics; Text categorization; Text recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Internet Technology and Secured Transactions, 2009. ICITST 2009. International Conference for
Conference_Location :
London
Print_ISBN :
978-1-4244-5647-5
Type :
conf
DOI :
10.1109/ICITST.2009.5402564
Filename :
5402564
Link To Document :
بازگشت