An open-source toolkit for debugging AI models of all data typesTechnical noteKevin Eykholt and Taesung Lee08 Sep 2023Adversarial Robustness and PrivacyAI TestingData and AI Security
Did an AI write that? If so, which one? Introducing the new field of AI forensicsExplainerKim Martineau24 Jul 2023Adversarial Robustness and PrivacyAIExplainable AIFoundation ModelsGenerative AITrustworthy AI
Manipulating stock prices with an adversarial tweetResearchKim Martineau13 Jul 2022Adversarial Robustness and PrivacyTrustworthy AI
Securing AI systems with adversarial robustnessDeep DivePin-Yu Chen15 Dec 20218 minute readAdversarial Robustness and PrivacyAIData and AI Security
Protecting Users From Themselves: Safeguarding Contextual Privacy in Interactions with Conversational AgentsIvoline NgongSwanand Ravindra Kadheet al.2025ACL 2025
In-Context Bias Propagation in LLM-Based Tabular Data GenerationPol Garcia RecasensAlberto Gutierrez-torreet al.2025ICML 2025
MAD-MAX: Modular And Diverse Malicious AttackMiXtures for Automated LLM Red TeamingStefan SchoepfMuhammad Zaid Hameedet al.2025ICML 2025