Skip to content

Latest commit

 

History

History
201 lines (122 loc) · 7.81 KB

README.md

File metadata and controls

201 lines (122 loc) · 7.81 KB

PyTorch Interpret

A simple to use PyTorch library for interpreting your deep learning results, using both visualisations and attributions. Inspired by TensorFlow Lucid.

Build Status Coverage Status


Installation

Install from PyPI:

pip install interpret-pytorch

Or, install the latest code from GitHub:

pip install git+https://github.com/ttumiel/interpret

Dependencies

interpret requires a working installation of PyTorch.

Contents

Tutorials

Run the tutorials in the browser using Google Colab.

Tutorial Link
Introduction to interpret Open In Colab
Visualisation Tutorial Open In Colab
Miscellaneous Methods Tutorial Open In Colab

Visualisation

Channel visualisations using pytorch interpret.

Visualisation is a technique that generates inputs that optimise a particular objective within a trained network. By using visualisations, we can understand what it is that a network is looking for. For an in-depth explanation of visualisation, see Feature Visualisation.

Quickstart

Generating visualisations is done by loading a trained network, selecting the objective to optimise for and running the optimisation. An example using a pretrained network from torchvision is shown.

from interpret import OptVis
import torchvision

# Get the PyTorch neural network
network = torchvision.models.vgg11(pretrained=True)

# Select a layer from the network. Use get_layer_names()
# to see a list of layer names and sizes.
layer = 'features/18'
channel = 12

# Create an OptVis object from a PyTorch model
optvis = OptVis.from_layer(network, layer=layer, channel=channel)

# Create visualisation
optvis.vis()

Parameterisations

Images can be parameterised in several different ways. As long as the parameterisation is differentiable, the input can be optimised for a particular layer. For code examples, see the Visualisation Tutorial Notebook.

The default parameterisation is in spatial and colour decorrelated space.

Decorrelated visualisations

We can also parameterise in regular pixel space but the visualisations tend to be worse.

Pixel space parameterised visualisations

Another parameterisation is a compositional pattern producing network (CPPN) which can generate infinite resolution images that have the effect of "light paintings."

CPPN visualisations

Objectives

The objective on which to optimise can also be manipulated to create different visualisations. We can add objectives together to get compound objectives or negate them to get negative neurons. See the Visualisation Tutorial Notebook for examples.

Layer Objective

A LayerObjective can be created easily using the from_layer OptVis class method. In this function, we can choose the layer, channel and neuron to optimise for. Here we can optimise for a particular neuron:

Neuron visualisations

We can also manually create two objectives and add them together to get a compound objective:

Compound activations between more than one objective

Or we can find the negated objective that minimises a particular neuron:

Negative neurons minimise a particular activation

Layer objectives are fairly flexible. You can select any layer in the network and capture the output of that particular layer. We can visualise the last layer of the network, generating class visualisations of the different classes in ImageNet.

Class Visualisations

Deep Dream Objective

The deep dream objective optimises for "interestingness" across an entire layer. We can create this objective from an input image and select a layer using the from_dream class method.

Deep dream objective visualisations


Attribution

Attribution methods show where a neural network is looking when it makes a certain prediction.

Network attribution is done by feeding a particular input into the trained network and generating a saliency map that shows the parts of the image that the network activates highly on.

Quickstart

from interpret import Gradcam, norm
from PIL import Image
import torchvision

network = torchvision.models.vgg11(pretrained=True)
input_img = Image.open('image.jpg')

# Normalise the input image and turn it into a tensor
input_data = norm(input_img)

# Select the class that we are attributing to
class_number = 207

# Choose a layer for Grad-CAM
layer = 'features/20'

# Generate a Grad-CAM attribution map
saliency_map = Gradcam(network, input_data, im_class=class_number, layer=layer)
saliency_map.show()

Miscellaneous Interpretations

Included in interpret are a few additional interpretation methods that don't neatly fit into visualisation or attribution methods.

Top Losses

Plot the inputs that result in the largest loss. Useful for identifying where your network is most unsure or where the inputs actually don't fit the label given (a mislabelled image). You can also enable a Grad-CAM attribution overlay for each image so that you can tell where the network is looking.

Top losses plotted with Grad-CAM attribution overlay.

Confusion Matrix

Plot a confusion matrix for a multi-class classification or binned regression objective.

Confusion matrix on 10 classes

Dataset Examples

Plot some dataset examples that maximise a particular LayerObjective from the visualisation objectives described above. Useful for identifying clear examples of what the network is looking for in a particular visualisation using real examples.

Comparison between a layer visualisation and dataset examples that also activate the same layer.

Loss Landscape

Plot the loss landscape in 2 directions around a trained network optimum. You can plot both a surface plot and a contour plot. For details, see Li et al. Below we see how adding shortcut connections in a ResNet56 makes the loss landscape much smoother. Note the difference in scale.

Comparison of loss landscape between a ResNet56 with shortcut connections and without.