Title :
Identification of opinions in Arabic newspapers
Author :
Lazhar, Farek ; Yamina, Tlili Guiassa
Author_Institution :
Comput. Sci. Dept., Univ. of Annaba, Sidi Amar, Algeria
Abstract :
Identification of opinions is a set of techniques which is a part of the natural language processing, especially in the information research area. This consists in developing systems able to extract and explore the opinions existing in corpuses. The presence of important textual mass of Arabic newspapers in an electronic format requires a particular exploration technique. We intend to present in this paper a system of opinions identification, based on the model of Aila Rosà, representing the opinion as an object composed of four elements : predicate, source, topic and content. Two properties: polarity and intensity which are inspired from the work of Plantié Mathieu and are added to this model to establish relationships between the different opinions present in the text according to their different degrees of intensity and polarity. In presenting its general architecture, our system uses several techniques such as: XML representation of opinions, semantic expansion of opinions as explained by Nicolas B and finally a statistical representation of the opinions in occurrences matrix format to facilitate the calculation of the similarity between the opinions in the classification phase.
Keywords :
electronic publishing; natural language processing; text analysis; Arabic newspapers; electronic newspaper; natural language processing; opinions identification; semantic expansion; Computational modeling; Encoding; Pragmatics; Semantics; Syntactics; XML; Arabic Language; Identification; Natural Language Processing; Newspapers; Opinions; Semantic Expansion;
Conference_Titel :
Machine and Web Intelligence (ICMWI), 2010 International Conference on
Conference_Location :
Algiers
Print_ISBN :
978-1-4244-8608-3
DOI :
10.1109/ICMWI.2010.5648141