evhub
Alignment Stress-Testing Team Lead @anthropics. Previously: @machine-intelligence, @openai, @google, @Yelp, @ripple.
AnthropicSan Francisco, California
evhub's Stars
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
tailscale/tailscale
The easiest, most secure way to use WireGuard and 2FA.
smol-ai/developer
the first library to let you embed a developer agent in your own app!
llm-attacks/llm-attacks
Universal and Transferable Attacks on Aligned Language Models
jupyter-widgets/ipywidgets
Interactive Widgets for the Jupyter Notebook
huggingface/safetensors
Simple, safe way to store and distribute tensors
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
python-lsp/python-lsp-server
Fork of the python-language-server project, maintained by the Spyder IDE team and the community
anthropics/anthropic-sdk-python
microsoft/vscode-jupyter
VS Code Jupyter extension
jupyter-xeus/xeus-python
Jupyter kernel for the Python programming language
composable-models/llm_multiagent_debate
ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate
google/pyink
Pyink, pronounced pī-ˈiŋk, is a Python formatter, forked from Black with a few different formatting behaviors.
callummcdougall/ARENA_2.0
Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.
Giskard-AI/awesome-ai-safety
📚 A curated list of papers & technical articles on AI Quality & Safety
Thrandis/EKFAC-pytorch
Repository containing Pytorch code for EKFAC and K-FAC perconditioners.
google/sycophancy-intervention
Scripts for generating synthetic finetuning data for reducing sycophancy.
python-trio/async_generator
Making it easy to write async iterators in Python 3.5
microsoft/vscode-jupyter-powertoys
PowerToys for Jupyter notebooks in VS Code
groove-x/trio-util
Utility library for the Python Trio async/await framework
socketteer/worldspider
gpt completions in vscode
klausweiss/typing-protocol-intersection
Protocols intersection for mypy
Silvenga/GnuWin32-Installer
A single MSI installer for the binaries created by the GnuWin32 Project.
nrimsky/InfluenceFunctions
Implementation of Influence Function approximations for differently sized ML models, using PyTorch
MatteoH2O1999/setup-python
Set up your GitHub Actions workflow with a specific version of Python including deprecated ones.
thestephencasper/mechanistic_interpretability_challenge
FabienRoger/Learning-From-Negative-Examples
UlisseMini/gpt-chess
gpt-3.5-turbo-instruct playing chess at 1800 elo! https://lichess.org/@/konaz
FabienRoger/Password-Locked-LLM