esbenkc's Stars
SWE-agent/SWE-agent
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
rougier/scientific-visualization-book
An open access book on scientific visualization using python and matplotlib
GreyDGL/PentestGPT
A GPT-empowered penetration testing tool
smol-ai/GodMode
AI Chat Browser: Fast, Full webapp access to ChatGPT / Claude / Bard / Bing / Llama2! I use this 20 times a day.
EasyJailbreak/EasyJailbreak
An easy-to-use Python framework to generate adversarial jailbreak prompts.
jzhang38/LongMamba
Some preliminary explorations of Mamba's context scaling.
rgreenblatt/arc_draw_more_samples_pub
Draw more samples
METR/task-standard
METR Task Standard
centerforaisafety/wmdp
WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning method which reduces LLM performance on WMDP while retaining general capabilities.
METR/public-tasks
andyzorigin/cybench
schroederdewitt/perfectly-secure-steganography
Contains open source code for the paper "Perfectly-secure Steganography using Minimum Entropy Coupling"
thestephencasper/everything-you-need
we got you bro
callummcdougall/sae_visualizer
Mindgard/cli
Test your AI model's security through CLI
noemaresearch/pinboard
Pin files for contextual, codebase-level AI assistance.
camtice/SandbagDetect
nlpet/democracy-ai-hackathon
This repository contains code for the Democracy x AI Hackathon by Apart Research
apartresearch/evaluations-starter
How to get started in evaluations and demonstrations research for dangerous capabilities
apartresearch/Research-Augmentation-Hackbook
apartresearch/3cb
3cb: Catastrophic Cyber Capabilities Benchmarking of Large Language Models
matthewjlutz/MASec-info-flow
Python code for "Fishing for the answer: Mapping the flow of information in LLM agent groups using lessons from fish schools" submitted to Apart Research Multi-Agent Security Hackathon 2024.
esbenkc/karnak
🔐 Make sure AI applications are not injecting 1) suspicious API calls, 2) vulnerabilities, and 3) rogue capabilities
esbenkc/cyberwarfare
🤝 Cyberwarfare vulnerabilities for democracy
Lovkush-A/inspect_multiturn_dialogue
apartresearch/hackathon-utils
😎 Code to run hackathons efficiently
apartresearch/task-standard
🚨 METR Task Standard fork for the Code Red Hackathon
lennart-finke/picturebooks
Which objects are visible through the holes in a picture book? This visual task is easy for adults, doable for primary schoolers, but hard for vision transformers.
simonwisdom/public-comment-generator
timothee-chauvin/PrimeVul-assets-on-github
PrimeVul with the assets under version control on github, not on google drive