csinva / hierarchical-dnn-interpretations

Licence: MIT license

Using / reproducing ACD from the paper "Hierarchical interpretations for neural network predictions" 🧠 (ICLR 2019)

Programming Languages

Jupyter Notebook

11667 projects

python

139335 projects - #7 most used programming language

TeX

3793 projects

Projects that are alternatives of or similar to hierarchical-dnn-interpretations

deep-explanation-penalization

Code for using CDEP from the paper "Interpretations are useful: penalizing explanations to align neural networks with prior knowledge" https://arxiv.org/abs/1909.13584

Stars: ✭ 110 (+0%)

Mutual labels: ml, interpretability, feature-importance, explainable-ai, explainability

zennit

Zennit is a high-level framework in Python using PyTorch for explaining/exploring neural networks using attribution methods like LRP.

Stars: ✭ 57 (-48.18%)

Mutual labels: interpretability, explainable-ai, explainability

responsible-ai-toolbox

This project provides responsible AI user interfaces for Fairlearn, interpret-community, and Error Analysis, as well as foundational building blocks that they rely on.

Stars: ✭ 615 (+459.09%)

Mutual labels: ml, explainable-ai, explainability

Transformer-MM-Explainability

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

Stars: ✭ 484 (+340%)

Mutual labels: interpretability, explainable-ai, explainability

mllp

The code of AAAI 2020 paper "Transparent Classification with Multilayer Logical Perceptrons and Random Binarization".

Stars: ✭ 15 (-86.36%)

Mutual labels: interpretability, explainable-ai, explainability

Interpret

Fit interpretable models. Explain blackbox machine learning.

Stars: ✭ 4,352 (+3856.36%)

Mutual labels: interpretability, explainable-ai, explainability

ProtoTree

ProtoTrees: Neural Prototype Trees for Interpretable Fine-grained Image Recognition, published at CVPR2021

Stars: ✭ 47 (-57.27%)

Mutual labels: interpretability, explainable-ai, explainability

concept-based-xai

Library implementing state-of-the-art Concept-based and Disentanglement Learning methods for Explainable AI

Stars: ✭ 41 (-62.73%)

Mutual labels: interpretability, explainable-ai, explainability

Shap

A game theoretic approach to explain the output of any machine learning model.

Stars: ✭ 14,917 (+13460.91%)

Mutual labels: interpretability, explainability

Captum

Model interpretability and understanding for PyTorch

Stars: ✭ 2,830 (+2472.73%)

Mutual labels: interpretability, feature-importance

Xai

XAI - An eXplainability toolbox for machine learning

Stars: ✭ 596 (+441.82%)

Mutual labels: ml, interpretability

removal-explanations

A lightweight implementation of removal-based explanations for ML models.

Stars: ✭ 46 (-58.18%)

Mutual labels: interpretability, explainability

yggdrasil-decision-forests

A collection of state-of-the-art algorithms for the training, serving and interpretation of Decision Forest models.

Stars: ✭ 156 (+41.82%)

Mutual labels: ml, interpretability

Imodels

Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).

Stars: ✭ 194 (+76.36%)

Mutual labels: ml, interpretability

Awesome Production Machine Learning

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

Stars: ✭ 10,504 (+9449.09%)

Mutual labels: interpretability, explainability

Mindsdb

Predictive AI layer for existing databases.

Stars: ✭ 4,199 (+3717.27%)

Mutual labels: ml, explainable-ai

adaptive-wavelets

Adaptive, interpretable wavelets across domains (NeurIPS 2021)

Stars: ✭ 58 (-47.27%)

Mutual labels: interpretability, explainability

thermostat

Collection of NLP model explanations and accompanying analysis tools

Stars: ✭ 126 (+14.55%)

Mutual labels: interpretability, explainability

ALPS 2021

XAI Tutorial for the Explainable AI track in the ALPS winter school 2021

Stars: ✭ 55 (-50%)

Mutual labels: interpretability, explainability

sage

For calculating global feature importance using Shapley values.

Stars: ✭ 129 (+17.27%)

Mutual labels: interpretability, explainability

View All Similar Projects ➔

Hierarchical neural-net interpretations (ACD) 🧠

Produces hierarchical interpretations for a single prediction made by a pytorch neural network. Official code for Hierarchical interpretations for neural network predictions (ICLR 2019 pdf).

Documentation • Demo notebooks

Note: this repo is actively maintained. For any questions please file an issue.

examples/documentation

installation: pip install acd (or clone and run python setup.py install)
examples: the reproduce_figs folder has notebooks with many demos
src: the acd folder contains the source for the method implementation
allows for different types of interpretations by changing hyperparameters (explained in examples)
all required data/models/code for reproducing are included in the dsets folder

Inspecting NLP sentiment models	Detecting adversarial examples	Analyzing imagenet models

notes on using ACD on your own data

the current CD implementation often works out-of-the box, especially for networks built on common layers, such as alexnet/vgg/resnet. However, if you have custom layers or layers not accessible in net.modules(), you may need to write a custom function to iterate through some layers of your network (for examples see cd.py).
to use baselines such build-up and occlusion, replace the pred_ims function by a function, which gets predictions from your model given a batch of examples.

related work

CDEP (ICML 2020 pdf, github) - penalizes CD / ACD scores during training to make models generalize better
TRIM (ICLR 2020 workshop pdf, github) - using simple reparameterizations, allows for calculating disentangled importances to transformations of the input (e.g. assigning importances to different frequencies)
PDR framework (PNAS 2019 pdf) - an overarching framewwork for guiding and framing interpretable machine learning
DAC (arXiv 2019 pdf, github) - finds disentangled interpretations for random forests
Baseline interpretability methods - the file scores/score_funcs.py also contains simple pytorch implementations of integrated gradients and the simple interpration technique gradient * input

reference

feel free to use/share this code openly
if you find this code useful for your research, please cite the following:

@inproceedings{
   singh2019hierarchical,
   title={Hierarchical interpretations for neural network predictions},
   author={Chandan Singh and W. James Murdoch and Bin Yu},
   booktitle={International Conference on Learning Representations},
   year={2019},
   url={https://openreview.net/forum?id=SkEqro0ctQ},
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

csinva / hierarchical-dnn-interpretations

Programming Languages

Labels

Projects that are alternatives of or similar to hierarchical-dnn-interpretations

Hierarchical neural-net interpretations (ACD) 🧠

examples/documentation

notes on using ACD on your own data

related work

reference