Pinned Repositories
CT4Recognition
DomainBed-Spawrious
DomainBed is a suite to test domain generalization algorithms
evals_manip
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
generate_spawrious
HarmBench-goose
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
latent-adversarial-training
machiavelli_manip
manipulation-chatarena
Fork of chatarena: add examples that help to study the manipulation capabilities of LLMs
spawrious
tasks-goose
My own set of general evaluations to be shared between projects
aengusl's Repositories
aengusl/latent-adversarial-training
aengusl/spawrious
aengusl/manipulation-chatarena
Fork of chatarena: add examples that help to study the manipulation capabilities of LLMs
aengusl/evals_manip
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
aengusl/machiavelli_manip
aengusl/tasks-goose
My own set of general evaluations to be shared between projects
aengusl/CT4Recognition
aengusl/DomainBed-Spawrious
DomainBed is a suite to test domain generalization algorithms
aengusl/generate_spawrious
aengusl/HarmBench-goose
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
aengusl/langchain-js-chatbot
GPT4 & LangChain Chatbot for large PDF docs
aengusl/multimodal-eng