• DocumentCode
    2133179
  • Title

    Improving text categorization by resolving semantic ambiguity

  • Author

    Uejima, Hiroshi ; Miura, Tsuyoshi ; Shioya, Lsamu

  • Author_Institution
    Dept. of Elect. & Elect. Engr., Hosei Univ., Tokyo, Japan
  • Volume
    2
  • fYear
    2003
  • fDate
    28-30 Aug. 2003
  • Firstpage
    796
  • Abstract
    In this investigation, we propose a new method for text categorization (TC) based on Bayesian approach by resolving ambiguity. The TC assumes weights to words of which meanings are ambiguous in a sense of synonymy and polysemy. We give weights to articles by examining dictionaries of thesaurus and of dimensionality reduction to improve the quality of TC. Also we show some experiments to illustrate how well our approach goes.
  • Keywords
    dictionaries; text analysis; thesauri; word processing; Bayesian approach; dimensionality reduction; polysemy; semantic ambiguity resolving; text categorization quality improvement; text mining; thesaurus dictionary; Bayesian methods; Data mining; Dictionaries; Humans; Informatics; Information retrieval; Routing; Stress; Text categorization; Thesauri;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications, Computers and signal Processing, 2003. PACRIM. 2003 IEEE Pacific Rim Conference on
  • Print_ISBN
    0-7803-7978-0
  • Type

    conf

  • DOI
    10.1109/PACRIM.2003.1235901
  • Filename
    1235901