DocumentCode
3241495
Title
Genetic Algorithm Based to Improve HTML Document Retrieval
Author
Al-Dallal, Ammar ; Abdul-Wahab, Rasha S.
Author_Institution
Sch. of Inf. Syst. Comput. & Math., Brunel Univ., Uxbridge, UK
fYear
2009
fDate
14-16 Dec. 2009
Firstpage
343
Lastpage
348
Abstract
This paper describes GAHWM, a new evolutionary algorithm that integrates genetic algorithm paradigm with an inverted index model to mine the content of HTML documents for effective Web document retrieval. This method is superior in terms of recall and precision over various real life datasets.
Keywords
Internet; data mining; genetic algorithms; hypermedia markup languages; information retrieval; GAHWM; HTML Web content mining; HTML document retrieval; Web document retrieval; evolutionary algorithm; genetic algorithm; inverted index model; Biological cells; Content based retrieval; Data mining; Evolutionary computation; Genetic algorithms; HTML; Information retrieval; Search engines; Web mining; Web pages; AI; Genetic Algorithm; Inverted Index; Web Mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Developments in eSystems Engineering (DESE), 2009 Second International Conference on
Conference_Location
Abu Dhabi
Print_ISBN
978-1-4244-5401-3
Electronic_ISBN
978-1-4244-5402-0
Type
conf
DOI
10.1109/DeSE.2009.57
Filename
5395140
Link To Document