Pinned Repositories
rlhf_trojan_competition
Finding trojans in aligned LLMs. Official repository for the competition hosted at SaTML 2024.
agentdojo
A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.
anthropic-tokenizer
Approximation of the Claude 3 tokenizer by inspecting generation stream
disasters-wikipedia-floods
javirandor
lm-evaluation-harness
A framework for few-shot evaluation of language models.
online-tutoring-analysis
passgpt
wdr
javirandor's Repositories
javirandor/anthropic-tokenizer
Approximation of the Claude 3 tokenizer by inspecting generation stream
javirandor/passgpt
javirandor/wdr
javirandor/disasters-wikipedia-floods
javirandor/agentdojo
A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.
javirandor/javirandor
javirandor/lm-evaluation-harness
A framework for few-shot evaluation of language models.
javirandor/online-tutoring-analysis
javirandor/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs