GitHub - INK-USC/DIG: Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021) [paper]

Overview of paths used in DIG and IG. w is the word being attributed. The gray region is the neighborhood of w. Green line depicts the straight-line path from w to w' used by IG and the green squares are the corresponding interpolation points. Left: In DIG-Greedy, we first monotonize each word in the neighborhood (red arrow). Then the word closest to its corresponding monotonic point is selected as the anchor (blue line to w_5 since the red arrow of w_5 has the shortest magnitude). Right: In DIG-MaxCount we first count the number of monotonic dimensions for each word in the neighborhood (shown in [.] above). Then, the word with the highest number of monotonic dimensions is selected as the anchor word (blue line to w_4), followed by changing the non-monotonic dimensions of w_4 (red line to c). Repeating this step gives the zigzag blue path. Finally, the red stars are the interpolated points used by our method. Please refer to the paper for more details.

Dependencies

Dependencies can be installed using requirements.txt.

Evaluating DIG:

Install all the requirements from requirements.txt.
Execute ./setup.sh for setting up the folder hierarchy for experiments.

Commands for reproducing the reported results on DistilBERT fine-tuned on SST2:

# Generate the KNN graph
python knn.py -dataset sst2 -nn distilbert

# DIG (strategy: Greedy)
python main.py -dataset sst2 -nn distilbert -strategy greedy

# DIG (strategy: MaxCount)
python main.py -dataset sst2 -nn distilbert -strategy maxcount

Similarly, commands can be changed for other settings.

Citation

Please cite the following paper if you use the code in your work:

@inproceedings{sanyal-ren-2021-discretized,
    title = "Discretized Integrated Gradients for Explaining Language Models",
    author = "Sanyal, Soumya  and Ren, Xiang",
    booktitle = "Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing",
    year = "2021",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.emnlp-main.805",
    pages = "10285--10299",
}

Please contact Soumya for any clarifications or suggestions.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
attributions.py		attributions.py
bert_helper.py		bert_helper.py
dig.py		dig.py
distilbert_helper.py		distilbert_helper.py
knn.py		knn.py
main.py		main.py
metrics.py		metrics.py
monotonic_paths.py		monotonic_paths.py
overview.png		overview.png
requirements.txt		requirements.txt
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021) [paper]

Dependencies

Evaluating DIG:

Citation

About

Releases

Packages

Languages

License

INK-USC/DIG

Folders and files

Latest commit

History

Repository files navigation

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021) [paper]

Dependencies

Evaluating DIG:

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages