[Summary] Add perturbation feature attribution methods #107

gsarti · 2021-11-30T18:58:34Z

🚀 Feature Request

The following is a non-exhaustive list of perturbation-based feature attribution methods that could be added to the library:

Method name	Source	In Captum	Code implementation	Status
(Layer) Feature Ablation¹	-	✅	`pytorch/captum`
Occlusion	Zeiler and Fergus '13	✅	`pytorch/captum`	✅
Shapley Value Sampling	Castro et al. '09	✅	`pytorch/captum`
Lime	Ribeiro et al. '16	✅	`pytorch/captum`	✅
KernelShap	Lundberg and Lee '17	✅	`pytorch/captum`
Editing ²	-	-	-
Greedy Rationalization ³	Vafa et al. '21	-	`keyonvafa/sequential-rationales`
Information Bottleneck	Jiang et al. '20	-	`DFKI-NLP/thermostat`
BayesLime	Slack et al. '21	-	`dylan-slack/Modeling-Uncertainty-Local-Explainability`
BayesSHAP	Slack et al. '21	-	`dylan-slack/Modeling-Uncertainty-Local-Explainability`
Input Reduction	Feng et al. '18	-	-
Input Marginalization	Kim et al. '20	-	-
Occlusion & Language Modeling	Harbecke and Alt '20	-	`DFKI-NLP/OLM`
Context Probing ⁴	Cífka and Liutkus '22	-	`cifkao/context-probing`
Weighted SHAP	Kwon and Zou '22	-	`ykwon0407/WeightedSHAP`
Value Zeroing	Mohebbi et al. '23	-	`hmohebbi/ValueZeroing`	#173
Comprehensiveness-as-a-metric	Zhou et al. '23	-	`YilunZhou/solvability-explainer`
Sufficiency-as-a-metric	Zhou et al. '23	-	`YilunZhou/solvability-explainer`
Causal Tracing	Meng et al. '22	-	`kmeng01/rome`
Attention Knockout⁵	Geva et al. '23	-	-
ReAGent	Zhao et al. '24	-	`casszhao/ReAGent`	#250
SyntaxSHAP	Amara et al. '24	-	`k-amara/syntax-shap`

Notes:

Called ablation, but perform masking of features using a baseline.
Editing replaces tokens with their nearest neighbors in the vocabulary embedding space and measures saliency as the drop in performance for the target. In the future, this can allow users to specify a custom editing strategy via an input Callable.
Possibly overlapping with feature ablation up to some measure.
Valid only for decoder-only models.
Verify whether it would be exactly equivalent to Value Zeroing, include only if functionally different (alias otherwise).

The text was updated successfully, but these errors were encountered:

nfelnlp · 2022-10-10T14:44:14Z

More methods related to Occlusion:

gsarti · 2022-10-11T08:22:01Z

Added to method table!

gsarti added the enhancement New feature or request label Nov 30, 2021

gsarti added this to the v1.0 milestone Nov 30, 2021

gsarti added help wanted Extra attention is needed good first issue Good for newcomers labels Nov 30, 2021

inseq-team deleted a comment from github-actions bot Nov 30, 2021

gsarti added the summary Summarizes multiple sub-tasks label Dec 1, 2021

gsarti removed the good first issue Good for newcomers label Apr 8, 2022

nfelnlp mentioned this issue Oct 24, 2022

Add OcclusionAttribution and LimeAttribution #145

Merged

1 task

gsarti pinned this issue Jan 24, 2023

gsarti removed this from the Demo Paper Release milestone May 8, 2023