Title :
A cascade mining algorithm based on Chinese keywords web mining
Author :
Zhou, Xueguang ; Zhang, Huanguo
Author_Institution :
Coll. of Electron. Eng., Naval Univ. of Eng., Wuhan
Abstract :
Security content filtering of World Wide Web is one of the important tasks among network security. The lower precision of Web mining based on keywords is a common fault, especially when those grouchy persons used active disturbing methods to cheat and bypass various filters. To filter these few but purposively or malicious Web pages the first thing is the classifier design. Therefore, a cascade mining algorithm was proposed, which consisted of one cascade classifier operator and three mining components, including jamming mining component, Bopomofo mining component and complicated characters mining component. After mined with the three components, the temporary mining result of a Web page would be dealt with the cascade classifier operator and got the last result; the classified Web page was normal or malicious. Experiments and analyses demonstrated that the cascade mining algorithm can solve the poser of how to classify for Chinese Web page simply, effective and precision.
Keywords :
Internet; data mining; information filtering; security of data; Bopomofo mining component; Chinese keywords Web mining; World Wide Web; cascade mining algorithm; classifier design; jamming mining component; malicious Web pages; network security; security content filtering; Filtering algorithms; Information filtering; Information filters; Information retrieval; Information security; Jamming; Uniform resource locators; Web mining; Web pages; World Wide Web; Bopomofo mining component; Web mining; cascade mining algorithm; complicated characters mining component; jamming mining component;
Conference_Titel :
Intelligent Control and Automation, 2008. WCICA 2008. 7th World Congress on
Conference_Location :
Chongqing
Print_ISBN :
978-1-4244-2113-8
Electronic_ISBN :
978-1-4244-2114-5
DOI :
10.1109/WCICA.2008.4593577