MCDB-R: Risk analysis in the database

Subi Arumugam; Ravi Jampani; Luis L. Perez; Fei Xu; Christopher Jermaine; Peter J. Haas

doi:10.14778/1920841.1920941

VLDB

Paper

01 Jan 2010

MCDB-R: Risk analysis in the database

View publication

Abstract

Enterprises often need to assess and manage the risk arising from uncertainty in their data. Such uncertainty is typically modeled as a probability distribution over the uncertain data values, specified by means of a complex (often predictive) stochastic model. The probability distribution over data values leads to a probability distribution over database query results, and risk assessment amounts to exploration of the upper or lower tail of a query-result distribution. In this paper, we extend the Monte Carlo Database System to efficiently obtain a set of samples from the tail of a query-result distribution by adapting recent "Gibbs cloning" ideas from the simulation literature to a database setting. © 2010 VLDB Endowment.

Conference paper