nli0

ML evaluations and safety @scaleapi @centerforaisafety, CS @ucberkeley. nli0.github.io

@ucberkeleySan Francisco, CA

nli0's Stars

jdholtz/auto-southwest-check-in
A Python script that automatically checks in to your Southwest flight 24 hours beforehand.
Language:Python48389
scaleapi/browser-art
Language:Python164
huggingface/evaluation-guidebook
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!
Language:Jupyter Notebook95659
baceolus/BioLP-bench
Benchmark for evaluating capabilities of AI models to understand biological lab protocols
Language:Python4
magic-wormhole/magic-wormhole
get things from one computer to another, safely
Language:Python20.7k663
rohan-paul/LLM-FineTuning-Large-Language-Models
LLM (Large Language Model) FineTuning
Language:Jupyter Notebook492117
UKGovernmentBEIS/inspect_ai
Inspect: A framework for large language model evaluations
Language:Python727160
GraySwanAI/circuit-breakers
Improving Alignment and Robustness with Circuit Breakers
Language:Jupyter Notebook17524
andyzoujm/representation-engineering
Representation Engineering: A Top-Down Approach to AI Transparency
Language:Jupyter Notebook77789
centerforaisafety/HarmBench
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Language:Jupyter Notebook39263
centerforaisafety/wmdp
WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning method which reduces LLM performance on WMDP while retaining general capabilities.
Language:Jupyter Notebook9326
fferflo/einx
Universal Tensor Operations in Einstein-Inspired Notation for Python.
Language:Python34210
aypan17/machiavelli
Language:Python12523
centerforaisafety/Intro_to_ML_Safety
6719

nli0

nli0's Stars

jdholtz/auto-southwest-check-in

scaleapi/browser-art

huggingface/evaluation-guidebook

baceolus/BioLP-bench

magic-wormhole/magic-wormhole

rohan-paul/LLM-FineTuning-Large-Language-Models

UKGovernmentBEIS/inspect_ai

GraySwanAI/circuit-breakers

andyzoujm/representation-engineering

centerforaisafety/HarmBench

centerforaisafety/wmdp

fferflo/einx

aypan17/machiavelli

centerforaisafety/Intro_to_ML_Safety