ChloeL19's Stars
ai-safety-foundation/sparse_autoencoder
Sparse Autoencoder for Mechanistic Interpretability
ChuyueSun/Clover
Clover: Closed-Loop Verifiable Code Generation
sun-wendy/DafnyBench
DafnyBench: A Benchmark for Formal Software Verification
CLARKBENHAM/circuit-breakers
Improving Alignment and Robustness with Circuit Breakers