• DocumentCode
    2989084
  • Title

    The Improved Naive Bayesian WEB Text Classification Algorithm

  • Author

    Bai, Ping ; Li, Junqing

  • Author_Institution
    Bus. Adm. Coll., WuHan Univ. of Sci. & Eng., Wuhan, China
  • fYear
    2009
  • fDate
    18-20 Jan. 2009
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    It is a very important task that how to classify Web pages automatically and effectively in accordance with the given model for machine learning. The traditional operation modes, including artificial way and semiautomatic way, form category abstracts after domain experts´ personnel inspection and then put the results into a particular class library according to the scheduled requirements. An improved naive Bayesian Web text classification algorithm is proposed in this paper. The common Bayesian classifier assumes that all the items are equally important while in this paper the terms in each title are considered to be more important than others. Experiments showed that, the improved naive Bayesian algorithm is more precise in the text classification.
  • Keywords
    Bayes methods; information retrieval systems; learning (artificial intelligence); text analysis; Web pages; category abstracts; class library; domain expert personnel inspection; machine learning; naive Bayesian Web text classification algorithm; Abstracts; Bayesian methods; Classification algorithms; Inspection; Libraries; Machine learning; Machine learning algorithms; Personnel; Text categorization; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Network and Multimedia Technology, 2009. CNMT 2009. International Symposium on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-5272-9
  • Type

    conf

  • DOI
    10.1109/CNMT.2009.5374684
  • Filename
    5374684