/dialz

The official repo for the Dialz Python library - a toolkit for steering vector research.

Primary LanguageJupyter NotebookMIT LicenseMIT

PyPI license

Dialz: A Python Toolkit for Steering Vectors

Docs: https://cardiffnlp.github.io/dialz/

Paper: https://www.arxiv.org/abs/2505.06262

About

Steering vectors allow users to modify activations at inference time to amplify or weaken a 'concept', e.g. honesty or positivity.

Dialz supports a diverse set of tasks, including creating contrastive pair datasets, computing and applying steering vectors, and visualizations.

A basic tutorial can be found here.

Installation

pip install dialz

Check out the full documentation for usage information.

Contributing

Any contributions to improve this project are welcome! Please open an issue or pull request in this repo with any changes you have.

License

This code is released under a MIT license.