Robust Machine Learning
How Well do Feature Visualizations Support Causal Understanding of CNN Activations?
We introduce a new psychophysical task to measure how well visualizations help humans predict the causal effects of interventions on unit activations, finding that widely-used feature visualizations provide no significant advantage over simpler alternatives like dataset samples in fostering causal understanding~[
].