3rd TrustAI Workshop: Building Public Awareness and EngagementMiriam RateikeBrian Mboyaet al.2025DLI 2025
Think Again! The Effect of Test-Time Compute on Preferences, Opinions, and Beliefs of Large Language ModelsGeorge KourItay Nakashet al.2025ACL 2025
NGQA: A Nutritional Graph Question Answering Benchmark for Personalized Health-aware Nutritional ReasoningZheyuan ZhangYiyang Liet al.2025ACL 2025
Multi-Level Explanations for Generative Language ModelsLucas Monteiro PaesDennis Weiet al.2025ACL 2025
BI-Bench : A Comprehensive Benchmark Dataset and Unsupervised Evaluation for BI SystemsAnkush GuptaAniya Aggarwalet al.2025ACL 2025
Protecting Users From Themselves: Safeguarding Contextual Privacy in Interactions with Conversational AgentsIvoline NgongSwanand Ravindra Kadheet al.2025ACL 2025
The ShareLM Collection and Plugin: Contributing Human-Model Chats for the Benefit of the CommunityShachar Don-YehiyaLeshem Choshenet al.2025ACL 2025
Avoiding Leakage Poisoning: Concept Interventions Under Distribution ShiftsMateo Espinosa ZarlengaGabriele Dominiciet al.2025ICML 2025
ConceptAttention: Diffusion Transformers Learn Highly Interpretable FeaturesAlec HelblingTuna Meralet al.2025ICML 2025
Learning interpretable positional encodings in transformers depends on initializationTaku ItoLuca Cocchiet al.2025ICML 2025