DocumentCode
2486635
Title
Improvement of Feature Extraction in Web Page Classification
Author
Jiao Lijuan ; Feng Liping
Author_Institution
Dept. of Comput. Sci., Xinzhou Teachers Univ., Xinzhou, China
fYear
2010
fDate
22-23 May 2010
Firstpage
1
Lastpage
3
Abstract
Mutual information formula is improved by using the hyperlink factor in this paper. Introduction of hyperlink elements of web pages can improve the classification accuracy in feature selection method based on mutual information and correlation by experiment, especially for those of strong. So the improvement is effective in web page classification.
Keywords
Web sites; classification; feature extraction; Web page classification; feature extraction; feature selection; hyperlink factor; Computer science; Data mining; Electronic mail; Feature extraction; Frequency; Mutual information; Optimization methods; Relational databases; Text categorization; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
e-Business and Information System Security (EBISS), 2010 2nd International Conference on
Conference_Location
Wuhan
Print_ISBN
978-1-4244-5893-6
Electronic_ISBN
978-1-4244-5895-0
Type
conf
DOI
10.1109/EBISS.2010.5473682
Filename
5473682
Link To Document