MoJE: Mixture of Jailbreak Experts, Naive Tabular Classifiers as Guard for Prompt Attacks
- 2024
- AIES 2024
Mark manages the AI Security & Privacy team in the Dublin Research Lab. The research topics for the team are the Security of Generative AI (guardrails, red-teaming) and Privacy Enhancing Technologies (PII detection, risk assessment, Differential Privacy).