Title :
Towards compressing Web graphs
Author :
Adler, Micah ; Mitzenmacher, Michael
Author_Institution :
Dept. of Comput. Sci., Massachusetts Univ., Amherst, MA, USA
Abstract :
We consider the problem of compressing graphs of the link structure of the World Wide Web. We provide efficient algorithms for such compression that are motivated by random graph models for describing the Web. The algorithms are based on reducing the compression problem to the problem of finding a minimum spanning free in a directed graph related to the original link graph. The performance of the algorithms on graphs generated by the random graph models suggests that by taking advantage of the link structure of the Web, one may achieve significantly better compression than natural Huffman-based schemes. We also provide hardness results demonstrating limitations on natural extensions of our approach
Keywords :
computational complexity; data compression; directed graphs; information resources; Huffman-based schemes; NP-hard problems; Web graphs compression; World Wide Web; algorithms; directed graph; efficient algorithms; hardness results; link structure; minimum spanning free; performance; random graph models; Compression algorithms; Computer science; Electronic mail; Engineering profession; Prototypes; Search engines; Testing; Tree graphs; Web pages; Web sites;
Conference_Titel :
Data Compression Conference, 2001. Proceedings. DCC 2001.
Conference_Location :
Snowbird, UT
Print_ISBN :
0-7695-1031-0
DOI :
10.1109/DCC.2001.917151