Greedy list intersection

Robert Krauthgamer; Aranyak Mehta; Vijayshankar Raman; Atri Rudra

doi:10.1109/ICDE.2008.4497512

ICDE 2008

Conference paper

01 Oct 2008

Greedy list intersection

View publication

Abstract

A common technique for processing conjunctive queries is to first match each predicate separately using an index lookup, and then compute the intersection of the resulting rowid lists, via an AND-tree. The performance of this technique depends crucially on the order of lists in this tree: it is important to compute early the intersections that will produce small results. But this optimization is hard to do when the data or predicates have correlation. We present a new algorithm for ordering the lists in an AND-tree by sampling the intermediate intersection sizes. We prove that our algorithm is near-optimal and validate its effectiveness experimentally on datasets with a variety of distributions. © 2008 IEEE.

Conference paper