Hi, I'm Mansi Phute

CS PhD at Georgia Tech
My research interests are Responsible AI and ML safety. I work on developing explanations for ML systems, analyzing them to identify vulnerabilities, and finding solutions to mitigate these issues. My UNDREAM system system offers a way to bridge differentiable rendering and photorealistic simulation for end-to-end adversarial attacks, thus enabling beter transferability of attacks to the physical world. My work includes LLM Self Defense, which leverages the model's own understanding of harm to protect itself from attacks.
I am currently a PhD student at Georgia Tech advised by Polo Chau as a part of the Polo Club of Data Science.
I have collaborated with designers, developers, and scientists at Intel Labs, Nanyang Technological University, and Dassault Systems.

Featured Publications

Bridging Differentiable Rendering and Photorealistic Simulation for End-to-end Adversarial Attacks
arXiv, 2025
A Large-Scale Dataset for Testing Robustness of Image Classifiers
NeurIPS, 2024
By Self Examination, LLMs Know They Are Being Tricked!
ICLR Tiny Paper, 2024