Pinned Repositories
speedscope
🔬 A fast, interactive web-based viewer for performance profiles.
sae-rm
Using SAE's to interpret Reward Models (RM)
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
pyreft
ReFT: Representation Finetuning for Language Models
pyvene
Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions
PinetreePantry's Repositories
PinetreePantry/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.