DocumentCode :
1908754
Title :
Subjectivity Classification of Filipino Text with Features Based on Term Frequency -- Inverse Document Frequency
Author :
Regalado, Ralph Vincent J. ; Chua, Jenina L. ; Co, Justin L. ; Tiam-Lee, Thomas James Z.
Author_Institution :
Center for Language Technol., De La Salle Univ., Manila, Philippines
fYear :
2013
fDate :
17-19 Aug. 2013
Firstpage :
113
Lastpage :
116
Abstract :
Subjectivity classification classifies a given document if it contains subjective information or not, or identifies which portions of the document are subjective. This research reports a machine learning approach on document-level and sentence-level subjectivity classification of Filipino texts using existing machine learning algorithms such as C4.5, Naïve Bayes, k-Nearest Neighbor, and Support Vector Machine. For the document-level classification, result shows that Support Vector Machines gave the best result with 95.06% accuracy. While for the sentence-level classification, Naïve Baves gave the best result with 58.75% accuracy.
Keywords :
learning (artificial intelligence); natural language processing; pattern classification; support vector machines; text analysis; C4.5 algorithm; Filipino text subjectivity classification; document-level subjectivity classification; k-nearest neighbor algorithm; machine learning approach; naive Bayes algorithm; sentence-level subjectivity classification; subjective document information; subjective information; support vector machine; term frequency-inverse document frequency; Accuracy; Buildings; Classification algorithms; Data models; Feature extraction; Machine learning algorithms; Support vector machines; Filipino language; TF-IDF; machine learning approach; subjectivity classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Asian Language Processing (IALP), 2013 International Conference on
Conference_Location :
Urumqi
Type :
conf
DOI :
10.1109/IALP.2013.40
Filename :
6646016
Link To Document :
بازگشت