• DocumentCode
    3241495
  • Title

    Genetic Algorithm Based to Improve HTML Document Retrieval

  • Author

    Al-Dallal, Ammar ; Abdul-Wahab, Rasha S.

  • Author_Institution
    Sch. of Inf. Syst. Comput. & Math., Brunel Univ., Uxbridge, UK
  • fYear
    2009
  • fDate
    14-16 Dec. 2009
  • Firstpage
    343
  • Lastpage
    348
  • Abstract
    This paper describes GAHWM, a new evolutionary algorithm that integrates genetic algorithm paradigm with an inverted index model to mine the content of HTML documents for effective Web document retrieval. This method is superior in terms of recall and precision over various real life datasets.
  • Keywords
    Internet; data mining; genetic algorithms; hypermedia markup languages; information retrieval; GAHWM; HTML Web content mining; HTML document retrieval; Web document retrieval; evolutionary algorithm; genetic algorithm; inverted index model; Biological cells; Content based retrieval; Data mining; Evolutionary computation; Genetic algorithms; HTML; Information retrieval; Search engines; Web mining; Web pages; AI; Genetic Algorithm; Inverted Index; Web Mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Developments in eSystems Engineering (DESE), 2009 Second International Conference on
  • Conference_Location
    Abu Dhabi
  • Print_ISBN
    978-1-4244-5401-3
  • Electronic_ISBN
    978-1-4244-5402-0
  • Type

    conf

  • DOI
    10.1109/DeSE.2009.57
  • Filename
    5395140