GitHub - csinva/hierarchical-dnn-interpretations: Using / reproducing ACD from the paper "Hierarchical interpretations for neural network predictions" 🧠 (ICLR 2019) (original) (raw)

Hierarchical neural-net interpretations (ACD) 🧠

Produces hierarchical interpretations for a single prediction made by a pytorch neural network. Official code for Hierarchical interpretations for neural network predictions (ICLR 2019 pdf).

Documentation •Demo notebooks

Note: this repo is actively maintained. For any questions please file an issue.

examples/documentation

Inspecting NLP sentiment models Detecting adversarial examples Analyzing imagenet models

notes on using ACD on your own data

reference

@inproceedings{ singh2019hierarchical, title={Hierarchical interpretations for neural network predictions}, author={Chandan Singh and W. James Murdoch and Bin Yu}, booktitle={International Conference on Learning Representations}, year={2019}, url={https://openreview.net/forum?id=SkEqro0ctQ}, }