Fixed-precision estimation of join selectivity
Peter J. Hass, Jeffrey F. Naughton, et al.
SIGMOD/PODS/ 1993
Popular web sites are already experiencing very heavy loads, and these loads will only increase as the number of users accessing them grows. These loads create both CPU and I/O bottlenecks. One promising solution already being employed to eliminate the CPU bottleneck is to replace a single processor server with a cluster of servers. Our goal in this paper is to develop buffer management algorithms that exploit the aggregate memory capacity of the machines in such a server cluster to attack the I/O bottleneck. The key challenge in designing such buffer management algorithms turns out to be controlling data replication so as to achieve a good balance between intra-cluster network traffic and disk I/O. At one extreme, the straightforward application of client-server memory management techniques to this cluster architecture causes duplication in memory among the servers and this tends to reduce network traffic but increases disk I/O, whereas at the other extreme, eliminating all duplicates tends to increase network traffic while reducing disk I/O. Accordingly, we present a new algorithm, Hybrid, that dynamically controls the amount of duplication. Through a detailed simulation, we show that on workloads characteristic of those experienced by Web servers, the Hybrid algorithm correctly trades off intra-cluster network traffic and disk I/O to minimize average response time.
Peter J. Hass, Jeffrey F. Naughton, et al.
SIGMOD/PODS/ 1993
H. Pirahesh, T.Y.Cliff Leung, et al.
ICDE 1997
P.J. Haas, Jeffrey F. Naughton, et al.
SIGMOD/PODS/ 1994
R. Agrawal, A. Gupta, et al.
ICDE 1997