DocumentCode :
1777052
Title :
Using neural network to combat with spam pages
Author :
Shahbazi, Moein ; Bidoki, Ali Mohammad Zareh ; Birgan, Meysam Shehni
Author_Institution :
Comput. Eng. Dept., Yazd Univ., Yazd, Iran
fYear :
2014
fDate :
29-30 Oct. 2014
Firstpage :
387
Lastpage :
392
Abstract :
Search engines resolve the most informational needs of users by indexing huge amounts of data from web pages. In this process, spam pages prevent users from reaching their desirable results. Spam pages use deceptive methods to get a higher rank than their real one in search engines. For a human expert, recognition of spam pages is an easy task, but it is too complicated for a machine. Regarding the large size of the web graph and the large number of web pages, leaving the whole task to human is impossible. As a solution to this challenge, in this paper we propose a semiautomatic method using a combinational ranking based on links between pages. At first, two valid sets of spam pages are specified by experts. Then a multilayered neural network trained by genetic algorithm will be used to calculate a global rank for the web graph. In this network the rank of spam pages is low. To evaluate the precision and performance of the proposed method, the Persian web graph corpus indexed by Parsijoo search engine is used. Experimental results show a better performance in comparison to other methods.
Keywords :
Internet; genetic algorithms; neural nets; search engines; unsolicited e-mail; Parsijoo search engine; Web graph; Web pages; combinational ranking; genetic algorithm; multilayered neural network; search engines; semiautomatic method; spam pages; Biological neural networks; Genetic algorithms; Sociology; Statistics; Training; Vectors; Search engines; neural network; ranking; spam pages; web graph;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Knowledge Engineering (ICCKE), 2014 4th International eConference on
Conference_Location :
Mashhad
Print_ISBN :
978-1-4799-5486-5
Type :
conf
DOI :
10.1109/ICCKE.2014.6993427
Filename :
6993427
Link To Document :
بازگشت