pyvene.models.interventions#
Classes
|
Intervention the original representations with activation addition. |
|
Intervene in the latent space of an autoencoder. |
|
Intervention that will modify its basis in a uncontrolled manner. |
|
Intervention in the rotated space with boundary mask. |
|
Collect activations. |
|
Constant source. |
|
Distributed representation. |
|
Intervention the original representations. |
|
Output of the IntervenableModel, including original outputs, intervened outputs, and collected activations. |
|
Interchange intervention on JumpReLU SAE's latent subspaces |
|
Localist representation. |
|
Intervention in the rotated space. |
|
Noise intervention |
|
Intervention in the pca space. |
|
Intervention in the rotated space. |
|
Intervention the original representations. |
|
Intervention in the original basis with binary mask. |
|
Intervention in the rotated space with boundary mask. |
|
Skip the current intervening layer's computation in the hook function. |
|
No source. |
|
Intervention the original representations with activation subtraction. |
|
Intervention the original representations. |
|
Intervention the original representations. |
|
Zero-out activations. |