DocumentCode
3103314
Title
Descriptive words for small Web collections
Author
Deepak, P. ; John, Jyothi
Author_Institution
Model Eng. Coll., Kochi, India
fYear
2004
fDate
19-23 April 2004
Firstpage
633
Lastpage
634
Abstract
This paper deals with the problem of identifying the subject of small sparsely linked collections of Web documents (Web community). In the course of attempts to find solutions for many problems concerning the Web, we are often left with a handful of pages dealing with something in common, but with very few links within them. This paper presents algorithms, which work on such collections and output a set of descriptive words, descriptive of the collection, ordered in the decreasing order of relevance. The set of most relevant words, which can be aptly called the "subject set", provides a close approximation of the topic that the collection deals with. The subject set of the first few results from a Web search could be used to further refine Web search. It could greatly simplify the Web search process by indexing web communities. It could well be used for parental monitoring systems, where the subject set of the collection of pages browsed by the child could point out intentions of the Web usage by the child.
Keywords
Internet; information retrieval; search engines; text analysis; Web collection; Web document; Web search; descriptive word; indexing; page browsing; parental monitoring system; Data mining; Educational institutions; Frequency; Indexing; Monitoring; Testing; Text analysis; Web pages; Web search; Web services;
fLanguage
English
Publisher
ieee
Conference_Titel
Information and Communication Technologies: From Theory to Applications, 2004. Proceedings. 2004 International Conference on
Print_ISBN
0-7803-8482-2
Type
conf
DOI
10.1109/ICTTA.2004.1307924
Filename
1307924
Link To Document