DocumentCode
2229573
Title
Linguistic Evaluation in the Classification in Portuguese Texts
Author
Camargo, Yuri ; Mello, Laila ; Leão, Jorge L S
Author_Institution
GTA- Grupo de Teleinformatica e Automacao - COPPE/UFRJ, Rio de Janeiro
fYear
2007
fDate
20-24 Oct. 2007
Firstpage
531
Lastpage
538
Abstract
This paper evaluates the performance of support vector machines, Naive Bayes, and neural networks as classifiers for the categorization of Portuguese texts. We present several experiments with two different corpora with different feature selection strategies. We consider the use of linguistic information in the definition of grammatical groups. A comparison of classifiers is presented and the error margins show excellent results when using a specific feature selection in association with the right classifier.
Keywords
Bayes methods; natural language processing; neural nets; pattern classification; support vector machines; text analysis; Naive Bayes; Portuguese text categorization; Portuguese texts; feature selection strategies; linguistic classification; linguistic evaluation; linguistic information; neural networks; support vector machines; Data mining; Information analysis; Intelligent networks; Intelligent systems; Machine intelligence; Neural networks; Nominations and elections; Support vector machine classification; Support vector machines; Text categorization;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Systems Design and Applications, 2007. ISDA 2007. Seventh International Conference on
Conference_Location
Rio de Janeiro
Print_ISBN
978-0-7695-2976-9
Type
conf
DOI
10.1109/ISDA.2007.154
Filename
4389662
Link To Document