Publication Highlights

Interpretability & Infrastructure

Captum: A unified and generic model interpretability library for PyTorch
Narine Kokhlikyan, Vivek Miglani, Miguel Martin, Edward Wang, Bilal Alsallakh, Jonathan Reynolds, Alexander Melnikov, Natalia Kliushkina, Carlos Araya, Siqi Yan, Orion Reblitz-Richardson
arXiv preprint, 2020
Investigating Saturation Effects in Integrated Gradients
Vivek Miglani, Narine Kokhlikyan, Bilal Alsallakh, Orion Reblitz-Richardson
Preprint, 2020

Alignment Research

In progress.