Explaining knock-on effects of bias mitigationSvetoslav NizhnichenkovRahul Nairet al.2023NeurIPS 2023
Contrastive Explanations for Comparing Preferences of Reinforcement LearningJasmina GajcinRahul Nairet al.2022AAAI 2022
Attack Atlas: A Practitioner's Perspective on Challenges and Pitfalls in Red Teaming GenAIAmbrish RawatStefan Schoepfet al.2024NeurIPS 2024
Designing Machine Learning Pipeline Toolkit for AutoML Surrogate Modeling OptimizationPaulito PalmesAkihiro Kishimotoet al.2023JuliaCon 2023