Share Email Print

Proceedings Paper

Web caching and prefetching: a data mining approach
Author(s): Amidha Shyamsukha; Archana Sathaye; Arun Swami
Format Member Price Non-Member Price
PDF $14.40 $18.00

Paper Abstract

With the increase in popularity of the Internet, the latency experienced by an individual, while accessing the Web, is increasing. In this paper, we investigate one approach to reducing latency by increasing the hit rate for a web cache. To this effect, we developed a predictive model for pre- fetching and a modified Least Recently Used (LRU) method called AssocLRU. This paper investigates the application of a data mining technique, called Association rules to the web domain. The association rules, predict the URLs a user might reference next, and this knowledge is used in our web caching and pre-fetching model. We developed a trace driven cache simulator to compare the performance of our predictive model with the widely used replacement policy, namely, LRU. The traces we used in our experiments were the traces of Web proxy activity taken at Virginia Tech and EPA HTTP. Our results show that our predictive pre-fetching model using association rules achieves a better hit rate than both LRU and AssocLRU.

Paper Details

Date Published: 27 March 2001
PDF: 8 pages
Proc. SPIE 4384, Data Mining and Knowledge Discovery: Theory, Tools, and Technology III, (27 March 2001); doi: 10.1117/12.421062
Show Author Affiliations
Amidha Shyamsukha, AltaVista Co. (United States)
Archana Sathaye, San Jose State Univ. (United States)
Arun Swami, Xerox Palo Alto Research Ctr. (United States)

Published in SPIE Proceedings Vol. 4384:
Data Mining and Knowledge Discovery: Theory, Tools, and Technology III
Belur V. Dasarathy, Editor(s)

© SPIE. Terms of Use
Back to Top