Learning to estimate query difficulty: Including applications to missing content detection and distributed information retrieval

Elad Yom-Tov; Shai Fine; David Carmel; Adam Darlow

doi:10.1145/1076034.1076121

SIGIR 2005

Conference paper

01 Dec 2005

Learning to estimate query difficulty: Including applications to missing content detection and distributed information retrieval

View publication

Abstract

In this article we present novel learning methods for estimating the quality of results returned by a search engine in response to a query. Estimation is based on the agreement between the top results of the full query and the top results of its sub-queries. We demonstrate the usefulness of quality estimation for several applications, among them improvement of retrieval, detecting queries for which no relevant content exists in the document collection, and distributed information retrieval. Experiments on TREC data demonstrate the robustness and the effectiveness of our learning algorithms. © 2005 ACM.

Paper