• DocumentCode
    685806
  • Title

    Extracting templates from Web pages

  • Author

    Manjula, R. ; Chilambuchelvan, A.

  • Author_Institution
    Dept. of CSE, R.M.K. Eng. Coll., Chennai, India
  • fYear
    2013
  • fDate
    12-14 Dec. 2013
  • Firstpage
    788
  • Lastpage
    791
  • Abstract
    In today´s world, World Wide Web is the most popular information providers. A website is a collection of web pages and Web pages usually include information for the users. The web sites are designed with common templates and content. The template is used to access the content easily by consistent structures even the templates are not explicitly announced The current Template extraction techniques are degrading the performance of web applications such as search engine due to irrelevant terms in templates. Hence, we present a new method for detecting and extracting templates from web pages automatically by identifying the relevant information.
  • Keywords
    Internet; search engines; Web pages; Web sites; World Wide Web; search engine; template extraction technique; Decision support systems; Erbium; Handheld computers; Document Object Model; Minimum Description Length; Template Extraction; VIPS;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Green Computing, Communication and Conservation of Energy (ICGCE), 2013 International Conference on
  • Conference_Location
    Chennai
  • Type

    conf

  • DOI
    10.1109/ICGCE.2013.6823541
  • Filename
    6823541