Title :
Categorization of product pages depending on information on the Web
Author :
Sato, Naoto ; Komiya, Kanako ; Fujimoto, Koji ; Kotani, Yoshiyuki
Author_Institution :
Grad. Sch. of Eng., Tokyo Univ. of Agric. & Technol., Koganei, Japan
Abstract :
In this paper, the authors categorize product pages on the Web depending on their information. We used naive Bayes and the complement naive Bayes classifier, and tried four kinds of features to categorize them: all the words of the titles of the product pages, the nouns extracted from the titles, all the words of the titles and the descriptions of the product pages, and the nouns extracted from them. The experiments show that the product pages can be classified most correctly depending on only the nouns of the titles of the product pages. Moreover the complement naive Bayes classifier outperformed the naive Bayes classifier.
Keywords :
Bayes methods; Internet; electronic commerce; pattern classification; Internet; information network; naive Bayes classifier; product pages categorization; Categorization; Complement Naive Bayes; Decision Support System; Internet Auction; Natural Language Processing;
Conference_Titel :
Computer Science and Software Engineering (JCSSE), 2011 Eighth International Joint Conference on
Conference_Location :
Nakhon Pathom
Print_ISBN :
978-1-4577-0686-8
DOI :
10.1109/JCSSE.2011.5930153