Publications

(2024). Generalized Group Data Attribution. Preprint.

PDF Cite

(2024). In-Context Explainers: Harnessing LLMs for Explaining Black Box Models. Preprint.

PDF Cite

(2024). On the Hardness of Faithful Chain-of-Thought Reasoning in Large Language Models. In ICML Workshop on Trustworthy Multi-modal Foundation Models and AI Agents (TiFA).

PDF Cite

(2023). On Minimizing the Impact of Dataset Shifts on Actionable Explanations. In UAI (Oral, Top 5%).

PDF Cite

(2023). Consistent Explanations in the Face of Model Indeterminacy via Ensembling. In ICML Workshop on Interpretable Machine Learning in Healthcare.

PDF Cite

(2023). GLOBE-CE: A Translation-Based Approach for Global Counterfactual Explanations. In ICML.

PDF Cite Poster

(2022). OpenXAI: Towards a Transparent Evaluation of Model Explanations. In NeurIPS.

PDF Cite

(2022). Global Counterfactual Explanations: Investigations, Implementations and Improvements. In ICLR Workshop on Privacy, Accountability, Interpretability, Robustness, Reasoning on Structured Data.

PDF Cite Poster