BUTTER-Clarifier

This repository contains a python package of neural network interpretability techniques (interpretability) and a keras callback to easily compute and capture data related to these techniques (we call these values metrics) during training. It was developed to be used with the BUTTER Deep Learning Experimental Framework, but does not depend on this framework and may be useful to projects outside of this framework.

The vision for this codebase is to collect algorithms for explainable artificial intelliegnce (XAI) in a single framework that is easy to use, easy to read, and can be expand upon. Here, we package XAI algorithms into a module called "metrics", which are implemented as python functions. The return type of a metric is typically a dictionary holding data of multiple data types, such as real values and numpy matrices. Callbacks and any other connector code is provided as necessary in a separate module to make these metrics more easily usable. This project depends on Tensorflow's Keras API, although it would be nice to try and support multiple backends one day.

Installation

Dependencies

Developed using Python 3.11 with pip on Linux.
Tensorflow 2.13
Scikit-Learn

VSCode Devcontainer

This package was developed using VSCode Dev Containers and contains VSCode configuration files. The easiest way to get started developing this package is to check this project out from Github using VSCode and open it using the VSCode Dev Containers extension.

Manual Install using Pip

Manually installing the package is also easy. To manually clone and install this project:

$ git clone [URL TO THIS REPO]
$ cd ./BUTTER-Clarifier
$ pip install -e .[test]

Testing

Pytest should be installed automatically when using the [test] extra argument, but you can install it manually using pip install pytest if needed. Tests can be run with the following command:

$ pytest ./test

Usage

Using the Keras callback

For training with Keras, a convenient Keras callback class is provided. The callback will compute the explainability metrics during model training and can store the results either in memory or as JSON files. The signature of its constructor is:

InterpretabilityMetricsKerasCallback(epochs, data, metrics="all", save_to_path=False, save_to_history=True)

epochs: Record metrics every multiple of this epoch.
data: 4-tuple with (X_test, X_train, y_test, y_train)
metrics: "all" is the only supported option for now, and is the default.
save_to_path: If string, save the metrics to this path as JSON files.
save_to_history: If true, save the metrics in memory using the callback's history property.

Example of how to use the callback during training:

from interpretability.callbacks import InterpretabilityMetricsKerasCallback

X_test, X_train, y_test, y_train = ...
model = ...

interpretability_callback = InterpretabilityMetricsKerasCallback(20, (X_train, X_test, y_train, y_test))

model.fit(
    ...
    callbacks=[
            interpretability_callback
        ]
)

print(interpretability_callback.history)

Using the metrics module

Instead of using the Keras callback, you can also use metric functions directly. Each set of metrics are implemented as Python functions. They have similar signatures, with the model being the first argument. However, some metrics require extra inputs and so they do not have identical signatures.

Example:

from interpretability.metrics import basic_statistics, linear_probes

X_test, X_train, y_test, y_train = ...
model = ...

# Simplest metric
out = basic_statistics(model)
print(out)

# Some metrics require data
out = linear_probes(model, X_test, y_test)
print(out)

List of Metrics

The following metrics are implemented. For more information on these metrics, including academic citations, please see the inline documentation.

metric name	description	outputs	references
basic_statistics	Package of basic statistics of weights and biases	Means of the weights and biases
sparsity	Package of metrics relateded to sparsity	Number of non-zero weights and biases.
layerwise_pca	Computes principal component analysis (PCA) of activations for each layer	Explained variance	Inspired by ¹ ²
linear_probes	Computes a linear regression of activations for each layer	MSE and MAE error for each layer	³
linear_cka	Linear Centered Kernel Alignment	Correlation matrix over the number of hidden layers	¹
cca	Cannonical Correlation Analysis	Correlation matrix over the number of hidden layers	¹
split_relus	Computes those hidden units which are split by the given input data	Boolean mask over hidden units, as well as total count and ratio of split units	Inspired by ⁴

Repository Contents

/interpretability: Python package containing metrics and the keras callback.
- Callbacks.py: Classes conforming to Keras callback API.
- Metrics.py: Functions that implement the explainability metrics.
/test: Python tests that can be run from the command line or as part of a CI pipeline.

No Expectation of Support

This software is released without the expectation of support. If you run into issues using this software, you may report an issue using the Github issue tracker. However, the developers do not make any guarantees about responding to these issues.

Contributing and Roadmap

Some places where this codebase could be improved include:

Adding more metrics.
Adding more comprehensive tests.
Expanding metrics to different neural network architectures.
Add built-in visualization tools.
Improving the documentation and generating a documentation website.

Contributions are welcome, but we can not guarantee responsiveness around pull requests. We recommend first creating an issue in the Github issue tracker to alert us of your intent to contribute. Then, fork this repository into your own account and make the changes in your own fork. Finally, submit a pull request into our develop branch from your fork referencing the issue.

Similar Projects

Captum for PyTorch provides a collection of over thirty different model interpretability methods and is under active development.
Lucid for Tensorflow. Last commit was in 2021, and the project was archived by the maintainer in 2024.
ELI5 provides a unified interpretability api for multiple machine learning frameworks, including Scikit-Learn, Keras, and XGBoost. Appears to be inactive since 2022.
TF-Explain for Tensorflow, providing a callback architecture for training, and seven built-in interpretability methods. This library appears to be inactive since 2022.

Acknowledgements

This software was written by Jordan Perr-Sauer as part of the National Renewable Energy Laboratory's Laboratory Directed Research and Development program. It is released under software record SWR-23-61 "DMP Interpretability"

References

Kornblith, S., Norouzi, M., Lee, H. & Hinton, G.. (2019). Similarity of Neural Network Representations Revisited. Proceedings of the 36th International Conference on Machine Learning, in Proceedings of Machine Learning Research 97:3519-3529 Available from https://proceedings.mlr.press/v97/kornblith19a.html. ↩ ↩² ↩³
Raghu, M., Gilmer, J., Yosinski, J., & Sohl-Dickstein, J.N. (2017). SVCCA: Singular Vector Canonical Correlation Analysis for Deep Learning Dynamics and Interpretability. Neural Information Processing Systems. ↩
Alain, G., & Bengio, Y. (2016). Understanding intermediate layers using linear classifier probes. ArXiv, abs/1610.01644. ↩
Bak, S. (2021). nnenum: Verification of ReLU Neural Networks with Optimized Abstraction Refinement. In: Dutle, A., Moscato, M.M., Titolo, L., Muñoz, C.A., Perez, I. (eds) NASA Formal Methods. NFM 2021. Lecture Notes in Computer Science(), vol 12673. Springer, Cham. https://doi.org/10.1007/978-3-030-76384-8_2 ↩

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.devcontainer		.devcontainer
.github/workflows		.github/workflows
.vscode		.vscode
interpretability		interpretability
test		test
.gitignore		.gitignore
License.md		License.md
Readme.md		Readme.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BUTTER-Clarifier

Installation

Dependencies

VSCode Devcontainer

Manual Install using Pip

Testing

Usage

Using the Keras callback

Using the metrics module

List of Metrics

Repository Contents

No Expectation of Support

Contributing and Roadmap

Similar Projects

Acknowledgements

References

About

Languages

License

NREL/BUTTER-Clarifier

Folders and files

Latest commit

History

Repository files navigation

BUTTER-Clarifier

Installation

Dependencies

VSCode Devcontainer

Manual Install using Pip

Testing

Usage

Using the Keras callback

Using the metrics module

List of Metrics

Repository Contents

No Expectation of Support

Contributing and Roadmap

Similar Projects

Acknowledgements

References

Footnotes

About

Topics

Resources

License

Stars

Watchers

Forks

Languages