View all topics

Trustworthy Generation

Data is key to technological innovations. We develop theoretical and algorithmic frameworks for generative AI to synthesize realistic, diverse, and targeted data. Our methods facilitate data augmentation for trustworthy machine learning and accelerate novel designs for drug and material discovery, and beyond.

Our work

Debugging LLMs to improve their credibility
Research
Kim Martineau
30 Jul 2025
A 360 review of AI agent benchmarks
Research
Kim Martineau
04 Jun 2025
An invisible watermark to keep tabs on tabular data
Research
Kim Martineau
19 May 2025
Teaching AI models to improve themselves
Research
Peter Hess
14 Aug 2024
What is retrieval-augmented generation?
Explainer
Kim Martineau
22 Aug 2023
Accelerating molecular optimization with AI
Deep Dive
Payel Das, Samuel Hoffman, Vijil Chenthamarakshan, Kahini Wadhawan, and Pin-Yu Chen
08 Feb 2022
11 minute read
See more of our work on Trustworthy Generation

Publications

3rd TrustAI Workshop: Building Public Awareness and Engagement
- - Miriam Rateike
  - Brian Mboya
  - et al.
- 2025
- DLI 2025
Defensive Prompt Patch: A Robust and Generalizable Defense of Large Language Models against Jailbreak Attacks
- - Chen Xiong
  - Xiangyu Qi
  - et al.
- 2025
- ACL 2025
Combining Domain and Alignment Vectors Provides Better Knowledge-Safety Trade-offs in LLMs
- - Megh Thakkar
  - Quentin Fournier
  - et al.
- 2025
- ACL 2025
Think Again! The Effect of Test-Time Compute on Preferences, Opinions, and Beliefs of Large Language Models
- - George Kour
  - Itay Nakash
  - et al.
- 2025
- ACL 2025
Position: Theory of Mind Benchmarks are Broken for Large Language Models
- - Matthew Riemer
  - Zahra Ashktorab
  - et al.
- 2025
- ICML 2025
SPRI: Aligning Large Language Models with Context-Situated Principles
- - Hongli Zhan
  - Muneeza Azmat
  - et al.
- 2025
- ICML 2025

View all publications