DocumentCode :
638345
Title :
A novel model for Document Representation
Author :
Mountassir, Asmaa
Author_Institution :
Al-Bironi Res. Team, Mohamed 5 Univ., Rabat, Morocco
fYear :
2013
fDate :
27-30 May 2013
Firstpage :
1
Lastpage :
2
Abstract :
In this paper, we propose a novel model for Document Representation in an attempt to address the problem of huge dimensionality and vector sparseness that are commonly faced in Text Classification tasks. We conduct our experiments on data sets of Opinion Mining. We use as classifiers Support Vector Machines (SVM) and k-Nearest Neighbors (kNN). We compare the performance of our model with that of the classical representation based on Vector Space Model (VSM). Our experiments show that the effectiveness of our model depends on the used classifier. Results yielded by kNN when applying our model are the same as those obtained when applying the classical VSM. For SVM, results yielded when applying our model are typically lower than those obtained when using VSM. However, the gain in terms of time and dimensionality reduction is so promising since they are dramatically decreased by the application of our model.
Keywords :
data mining; data structures; pattern classification; support vector machines; text analysis; classical VSM; classifier SVM; classifier support vector machines; dimensionality reduction; document representation; k-nearest neighbors; kNN; opinion mining; text classification tasks; vector space model; Accuracy; Classification algorithms; Feature extraction; Support vector machine classification; Training; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Systems and Applications (AICCSA), 2013 ACS International Conference on
Conference_Location :
Ifrane
ISSN :
2161-5322
Type :
conf
DOI :
10.1109/AICCSA.2013.6616499
Filename :
6616499
Link To Document :
بازگشت