v0.1.3: Infrastructure updates and bug updates
What's Changed
- [Bug-Fix] fix attention head intervention for multiple models by @Bakser in #159
- [P2] Add Sparse Autoencoder Interventions by @explanare in #164
- [P0] Revert back the kwargs argument for intervention init by @frankaging in #166
- [P0] First infrastructure change to support NDIF backend by @frankaging in #167
- [P2] Save/load trainable params in
IntervenableBase
methods by @aryamanarora in #153 - [P1] Start docs website by @aryamanarora in #169
- device of unit_locations should follow tensor_input by @aryopg in #171
- [Minor] Fix use_cache flag propagation by @frankaging in #172
- [Minor] Fix the base model use_cache arg passing by @frankaging in #173
- [Minor] Update
use_cache
intervenable_base.py by @frankaging in #175 - [Minor] Start to support generic intervention output, and adaptor-like tuning by @frankaging in #177
- add support for dict outputs (select the first value) by @leogagnon in #178
- [Minor] Update the huggingface-hub version by @frankaging in #179
New Contributors
- @Bakser made their first contribution in #159
- @explanare made their first contribution in #164
- @aryopg made their first contribution in #171
- @leogagnon made their first contribution in #178
Full Changelog: v0.1.2...v0.1.3