XAI

Providing insight, explanations, and interpretability to machine learning methods.

101 resources4 categoriesView Original

Quick Navigation

Follow(3 items)

Rich Caruana

The man behind Explainable Boosting Machines.

The Institute for Ethical AI & Machine Learning

A UK-based research center that performs research into ethical AI/ML, which frequently involves XAI.

Tim Miller

One of the preeminent researchers in XAI.

Papers(93 items)

Ada-SISE

Adaptive semantice inpute sampling for explanation.

Papers

ALE

Accumulated local effects plot.

Papers

ALIME

Autoencoder Based Approach for Local Interpretability.

Papers

Anchors

High-Precision Model-Agnostic Explanations.

Papers

Attention is not --not-- Explanation

This is a rebutal to the above paper. Authors argue that multiple explanations can be valid and that the and that attention can produce *a* valid explanation, if not -the- valid explanation.

Papers

Attention is not Explanation

Authors perform a series of NLP experiments which argue attention does not provide meaningful explanations. They also demosntrate that different attentions can generate similar model outputs.

Papers

Auditing

Auditing black-box models.

Papers

BayLIME

Bayesian local interpretable model-agnostic explanations.

Papers

Break Down

Break down plots for additive attributions.

Papers

CAM

Class activation mapping.

Papers

CDT

Confident interpretation of Bayesian decision tree ensembles.

Papers

CICE

Centered ICE plot.

Papers

CMM

Combined multiple models metalearner.

Papers

Conj Rules

Using sampling and queries to extract rules from trained neural networks.

Papers

CP

Contribution propogation.

Papers

Decision List

Like a decision tree with no branches.

Papers

Decision Trees

The tree provides an interpretation.

Papers

DecText

Extracting decision trees from trained neural networks.

Papers

DeepLIFT

Deep label-specific feature learning for image annotation.

Papers

Do Not Trust Additive Explanations

Authors argue that addditive explanations (e.g. LIME, SHAP, Break Down) fail to take feature ineractions into account and are thus unreliable.

Papers

DTD

Deep Taylor decomposition.

Papers

Explainable Boosting Machine

Method that predicts based on learned vector graphs of features.

Papers

Explainable Deep Learning: A Field Guide for th...

An in-depth description of XAI focused on technqiues for deep learning.

Papers

ExplainD

Explanations of evidence in additive classifiers.

Papers

Explanation in Artificial Intelligence: Insight...

This paper provides an introduction to the social science research into explanations. The author provides 4 major findings: (1) explanations are constrastive, (2) explanations are selected, (3) probabilities probably don't matter, (4) explanations are social. These fit into the general theme that explanations are -contextual-.

Papers

FIRM

Feature importance ranking measure.

Papers

Fong, et. al.

Meaninful perturbations model.

Papers

G-REX

Rule extraction using genetic algorithms.

Papers

Gibbons, et. al.

Explain random forest using decision tree.

Papers

GoldenEye

Exploring classifiers by randomization.

Papers

GPD

Gaussian process decisions.

Papers

GPDT

Genetic program to evolve decision trees.

Papers

GradCAM

Gradient-weighted Class Activation Mapping.

Papers

GradCAM++

Generalized gradient-based visual explanations.

Papers

Hara, et. al.

Making tree ensembles interpretable.

Papers

ICE

Individual conditional expectation plots.

Papers

IG

Integrated gradients.

Papers

inTrees

Interpreting tree ensembles with inTrees.

Papers

IOFP

Iterative orthoganol feature projection.

Papers

IP

Information plane visualization.

Papers

k-Nearest Neighbors

The prototypical clustering method.

Papers

KL-LIME

Kullback-Leibler Projections based LIME.

Papers

Krishnan, et. al.

Extracting decision trees from trained neural networks.

Papers

Lei, et. al.

Rationalizing neural predictions with generator and encoder.

Papers

LIME

Local Interpretable Model-Agnostic Explanations.

Papers

Linear Regression

Easily plottable and understandable regression.

Papers

LOCO

Leave-one covariate out.

Papers

Logistic Regression

Easily plottable and understandable classification.

Papers

LORE

Local rule-based explanations.

Papers

Lou, et. al.

Accurate intelligibile models with pairwise interactions.

Papers

LRP

Layer-wise relevance propogation.

Papers

MCR

Model class reliance.

Papers

MES

Model explanation system.

Papers

MFI

Feature importance measure for non-linear algorithms.

Papers

Naive Bayes

Good classification, poor estimation using conditional probabilities.

Papers

NID

Neural interpretation diagram.

Papers

OptiLIME

Optimized LIME.

Papers

PALM

Partition aware local model.

Papers

PDA

Prediction Difference Analysis: Visualize deep neural network decisions.

Papers

PDP

Partial dependence plots.

Papers

Please Stop Permuting Features An Explanation a...

Authors demonstrate why permuting features is misleading, especially where there is strong feature dependence. They offer several previously described alternatives.

Papers

POIMs

Positional oligomer importance matrices for understanding SVM signal detectors.

Papers

ProfWeight

Transfer information from deep network to simpler model.

Papers

Prospector

Interactive partial dependence diagnostics.

Papers

QII

Quantitative input influence.

Papers

Quantifying Explainability of Saliency Methods ...

An analysis of how different heatmap-based saliency methods perform based on experimentation with a generated dataset.

Papers

REFNE

Extracting symbolic rules from trained neural network ensembles.

Papers

RETAIN

Reverse time attention model.

Papers

RISE

Randomized input sampling for explanation.

Papers

RuleFit

Sparse linear model as decision rules including feature interactions.

Papers

RxREN

Reverse engineering neural networks for rule extraction.

Papers

Sanity Checks for Saliency Maps

An important read for anyone using saliency maps. This paper proposes two experiments to determine whether saliency maps are useful: (1) model parameter randomization test compares maps from trained and untrained models, (2) data randomization test compares maps from models trained on the original dataset and models trained on the same dataset with randomized labels. They find that "some widely deployed saliency methods are independent of both the data the model was trained on, and the model parameters".

Papers

SHAP

A unified approach to interpretting model predictions.

Papers

SIDU

Similarity, difference, and uniqueness input perturbation.

Papers

Simonynan, et. al

Visualizing CNN classes.

Papers

Singh, et. al

Programs as black-box explanations.

Papers

STA

Interpreting models via Single Tree Approximation.

Papers

Stop Explaining Black Box Machine Learning Mode...

Authors present a number of issues with explainable ML and challenges to interpretable ML: (1) constructing optimal logical models, (2) constructing optimal sparse scoring systems, (3) defining interpretability and creating methods for specific methods. They also offer an argument for why interpretable models might exist in many different domains.

Papers

Strumbelj, et. al.

Explanation of individual classifications using game theory.

Papers

SVM+P

Rule extraction from support vector machines.

Papers

TCAV

Testing with concept activation vectors.

Papers

The (Un)reliability of Saliency Methods

Authors demonstrate how saliency methods vary attribution when adding a constant shift to the input data. They argue that methods should fulfill *input invariance*, that a saliency method mirror the sensistivity of the model with respect to transformations of the input.

Papers