[P2] Support argument name based intervention #11

frankaging · 2023-10-25T20:50:08Z

Description:
When using the hook, we can now support kwargs-based inputs by reading the input as a dictionary. However, we will always assume the dictionary only contains a single input (e.g., hidden representations). This assumption can easily go wrong. What should we do instead is to specify which part of the inputs we do interventions on in the config for the model.

Note that this will still result in coupled code with the Transformers library. Multiple PRs are required to move towards this direction.

frankaging self-assigned this Oct 25, 2023

frankaging changed the title ~~[P1] support argument name based intervention~~ [P1] Support argument name based intervention Nov 24, 2023

frankaging added enhancement New feature or request and removed enhancement New feature or request labels Nov 30, 2023

frankaging removed their assignment Dec 21, 2023

frankaging changed the title ~~[P1] Support argument name based intervention~~ [P2] Support argument name based intervention Jan 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[P2] Support argument name based intervention #11

[P2] Support argument name based intervention #11

frankaging commented Oct 25, 2023

[P2] Support argument name based intervention #11

[P2] Support argument name based intervention #11

Comments

frankaging commented Oct 25, 2023