ed1d1a8d's Stars
xai-org/grok-1
Grok open release
openai/openai-python
The official Python library for the OpenAI API
pqrs-org/Karabiner-Elements
Karabiner-Elements is a powerful tool for customizing keyboards on macOS
KindXiaoming/pykan
Kolmogorov Arnold Networks
overleaf/overleaf
A web-based collaborative LaTeX editor
astral-sh/rye
a Hassle-Free Python Experience
PaulJuliusMartinez/jless
jless is a command-line JSON viewer designed for reading, exploring, and searching through JSON data.
chaifeng/ufw-docker
To fix the Docker and UFW security flaw without disabling iptables
allenai/RL4LMs
A modular RL library to fine-tune language models to human preferences
TransformerLensOrg/TransformerLens
A library for mechanistic interpretability of GPT-style language models
AlignmentResearch/tuned-lens
Tools for understanding how transformer predictions are built layer-by-layer
lebrice/SimpleParsing
Simple, Elegant, Typed Argument Parsing with argparse
openphilanthropy/unrestricted-adversarial-examples
Contest Proposal and infrastructure for the Unrestricted Adversarial Examples Challenge
sony/ctm
justinchiu/openlogprobs
Extract full next-token probabilities via language model APIs
ArthurConmy/Automatic-Circuit-Discovery
GraySwanAI/circuit-breakers
Improving Alignment and Robustness with Circuit Breakers
wzekai99/DM-Improves-AT
Code for the paper "Better Diffusion Models Further Improve Adversarial Training" (ICML 2023)
ethz-spylab/rlhf_trojan_competition
Finding trojans in aligned LLMs. Official repository for the competition hosted at SaTML 2024.
MadryLab/datamodels-data
Data for "Datamodels: Predicting Predictions with Training Data"
anthropics/sleeper-agents-paper
Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".
cgarciae/einop
max-andr/adversarial-random-search-gpt4
Adversarial Attacks on GPT-4 via Simple Random Search [Dec 2023]
thestephencasper/everything-you-need
we got you bro
99991/cifar10-fast-simple
Train CIFAR10 to 94% accuracy in a few minutes/seconds. Based on https://github.com/davidcpage/cifar10-fast
ml-postech/robust-evaluation-of-diffusion-based-purification
[ICCV 2023 Oral] Official implementation of "Robust Evaluation of Diffusion-Based Adversarial Purification"
jplhughes/evals_template
Template for any evals project using LLM apis
Shavit-Lab/Sparse-Expansion
Code for the paper "Sparse Expansion and Neuronal Disentanglement."
xuwangyin/AT-EBMs
GilgameshxZero/xena
Android SVG editor optimized for e-ink note-taking tablets, such as the Onyx Boox series.