• DocumentCode
    2229573
  • Title

    Linguistic Evaluation in the Classification in Portuguese Texts

  • Author

    Camargo, Yuri ; Mello, Laila ; Leão, Jorge L S

  • Author_Institution
    GTA- Grupo de Teleinformatica e Automacao - COPPE/UFRJ, Rio de Janeiro
  • fYear
    2007
  • fDate
    20-24 Oct. 2007
  • Firstpage
    531
  • Lastpage
    538
  • Abstract
    This paper evaluates the performance of support vector machines, Naive Bayes, and neural networks as classifiers for the categorization of Portuguese texts. We present several experiments with two different corpora with different feature selection strategies. We consider the use of linguistic information in the definition of grammatical groups. A comparison of classifiers is presented and the error margins show excellent results when using a specific feature selection in association with the right classifier.
  • Keywords
    Bayes methods; natural language processing; neural nets; pattern classification; support vector machines; text analysis; Naive Bayes; Portuguese text categorization; Portuguese texts; feature selection strategies; linguistic classification; linguistic evaluation; linguistic information; neural networks; support vector machines; Data mining; Information analysis; Intelligent networks; Intelligent systems; Machine intelligence; Neural networks; Nominations and elections; Support vector machine classification; Support vector machines; Text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems Design and Applications, 2007. ISDA 2007. Seventh International Conference on
  • Conference_Location
    Rio de Janeiro
  • Print_ISBN
    978-0-7695-2976-9
  • Type

    conf

  • DOI
    10.1109/ISDA.2007.154
  • Filename
    4389662