DocumentCode
2215780
Title
WESPACT: — Detection of web spamdexing with decision trees in GA perspective
Author
Jayanthi, S.K. ; Sasikala, S.
Author_Institution
Comput. Sci. Dept., Vellalar Coll. for Women, Erode, India
fYear
2012
fDate
21-23 March 2012
Firstpage
381
Lastpage
386
Abstract
Internet today is huge, dynamic, self-organized, and strongly interlinked. Web spam can significantly worsen the quality of search engine results. The motivation of the paper is based on the logical perspective of approaching the web spam problem as cancer caused to the internet, and the solution could be derived by formulating the algorithms based on genetic algorithm (GA) based on content and link attributes. Web mining tools GATree [15] and PermutMatrix [14] has been used to simulate the experiments. JAVA is used to develop program that analyze and report the spamdexing instance. This paper proposes an algorithm WESPACT, to detect the web spam. This algorithm performs well as shown through experiments.
Keywords
Internet; Java; data mining; decision trees; genetic algorithms; search engines; security of data; unsolicited e-mail; GA; GA perspective; Internet; JAVA; PermutMatrix; WESPACT; Web mining tools GATree; Web spamdexing detection; content attributes; decision trees; genetic algorithm; link attributes; search engine result quality; Cancer; Classification algorithms; Computer aided software engineering; Genetic algorithms; Informatics; Pattern recognition; Search engines; Cancer; Content spam; Link spam; Search engine; WESPACT;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, Informatics and Medical Engineering (PRIME), 2012 International Conference on
Conference_Location
Salem, Tamilnadu
Print_ISBN
978-1-4673-1037-6
Type
conf
DOI
10.1109/ICPRIME.2012.6208376
Filename
6208376
Link To Document