DocumentCode :
3081344
Title :
Online Library Content Generation Using Focused Crawling Based Upon Meta Tags and Tf-Idf
Author :
Kumar, Manoj ; Vig, Renu
Author_Institution :
Inst. of Eng. & Technol., Panjab Univ., Chandigarh, India
fYear :
2013
fDate :
24-26 Aug. 2013
Firstpage :
158
Lastpage :
161
Abstract :
Electronic library is the collection of digital information related to an individual domain and in turn to all domains. A focused crawler traverses the Web looking for the pages most relevant to a domain and at the same time discarding the irrelevant pages and hence is helpful for generating the-e contents for digital library related to a particular domain. In this paper a focused crawling technique to generate online contents for e-library is proposed. The applicability of the proposed approach is shown by retrieving the documents which are highly related to a single domain. The quality of the pages included into the library is derived from the relevancy measure of the page with the content of domain related pages.
Keywords :
Internet; digital libraries; information retrieval; search engines; Tf-Idf; World Wide Web; digital information; digital library; document retrieval; domain related pages; e-content generation; e-library; electronic library; focused crawling; meta tags; online content generation; online library content generation; Crawlers; Indexes; Libraries; Marine animals; Search engines; Semantics; Web sites; Focused Web crawler; Tf-Idf; indexing.; information retrieval; search engine; semantics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational and Business Intelligence (ISCBI), 2013 International Symposium on
Conference_Location :
New Delhi
Print_ISBN :
978-0-7695-5066-4
Type :
conf
DOI :
10.1109/ISCBI.2013.73
Filename :
6724344
Link To Document :
بازگشت