Adversarial Auditing of Machine Learning Models under Compound Shift

Karan Bhanot; Dennis Wei; Ioana Baldini Soares; Kristin Bennett

ESANN 2023

Conference paper

04 Oct 2023

Adversarial Auditing of Machine Learning Models under Compound Shift

Abstract

Machine learning (ML) models often perform differently under distribution shifts, in terms of utility, fairness, and other dimensions. We propose the Adversarial Auditor for measuring the utility and fairness performance of ML models under compound shifts of outcome and protected attributes. We use Multi-Objective Bayesian Optimization (MOBO) to account for multiple metrics and identify shifts where model performance is extreme, both good and bad. Using two case studies, we show that MOBO performed better than random and grid-based approaches in identifying scenarios by adversarially optimizing objectives, highlighting the value of such an auditor for developing fair, accurate and shift-robust models.

Workshop paper