• DocumentCode
    3081344
  • Title

    Online Library Content Generation Using Focused Crawling Based Upon Meta Tags and Tf-Idf

  • Author

    Kumar, Manoj ; Vig, Renu

  • Author_Institution
    Inst. of Eng. & Technol., Panjab Univ., Chandigarh, India
  • fYear
    2013
  • fDate
    24-26 Aug. 2013
  • Firstpage
    158
  • Lastpage
    161
  • Abstract
    Electronic library is the collection of digital information related to an individual domain and in turn to all domains. A focused crawler traverses the Web looking for the pages most relevant to a domain and at the same time discarding the irrelevant pages and hence is helpful for generating the-e contents for digital library related to a particular domain. In this paper a focused crawling technique to generate online contents for e-library is proposed. The applicability of the proposed approach is shown by retrieving the documents which are highly related to a single domain. The quality of the pages included into the library is derived from the relevancy measure of the page with the content of domain related pages.
  • Keywords
    Internet; digital libraries; information retrieval; search engines; Tf-Idf; World Wide Web; digital information; digital library; document retrieval; domain related pages; e-content generation; e-library; electronic library; focused crawling; meta tags; online content generation; online library content generation; Crawlers; Indexes; Libraries; Marine animals; Search engines; Semantics; Web sites; Focused Web crawler; Tf-Idf; indexing.; information retrieval; search engine; semantics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational and Business Intelligence (ISCBI), 2013 International Symposium on
  • Conference_Location
    New Delhi
  • Print_ISBN
    978-0-7695-5066-4
  • Type

    conf

  • DOI
    10.1109/ISCBI.2013.73
  • Filename
    6724344