aengusl

have a beer

Pinned Repositories

CT4Recognition
Language:Python0 0 00
DomainBed-Spawrious
DomainBed is a suite to test domain generalization algorithms
Language:Python0 0 00
evals_manip
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Language:Python1 0 00
generate_spawrious
Language:Python0 1 00
HarmBench-goose
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Language:Jupyter Notebook0 0 00
latent-adversarial-training
Language:Jupyter Notebook30 1 111
machiavelli_manip
Language:Python1 0 00
manipulation-chatarena
Fork of chatarena: add examples that help to study the manipulation capabilities of LLMs
Language:Python2 0 00
spawrious
Language:Python25 3 03
tasks-goose
My own set of general evaluations to be shared between projects
Language:Jupyter Notebook1 0 00

aengusl/latent-adversarial-training
Language:Jupyter Notebook30 1 111
aengusl/spawrious
Language:Python25 3 03
aengusl/manipulation-chatarena
Fork of chatarena: add examples that help to study the manipulation capabilities of LLMs
Language:Python2 0 00
aengusl/evals_manip
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Language:Python1 0 00
aengusl/machiavelli_manip
Language:Python1 0 00
aengusl/tasks-goose
My own set of general evaluations to be shared between projects
Language:Jupyter Notebook1 0 00
aengusl/CT4Recognition
Language:Python0 0 00
aengusl/DomainBed-Spawrious
DomainBed is a suite to test domain generalization algorithms
Language:Python0 0 00
aengusl/generate_spawrious
Language:Python0 1 00
aengusl/HarmBench-goose
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Language:Jupyter Notebook0 0 00
aengusl/langchain-js-chatbot
GPT4 & LangChain Chatbot for large PDF docs
Language:TypeScript00
aengusl/multimodal-eng
Language:Python1 0