Pinned Repositories
interpretability-starter
π§ Starter templates for doing interpretability research
tdc2023-starter-kit
This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.
ai-cyberdefense
π₯ A repository for collecting cyberdefense thoughts, books, and documents about AI cyberdefense
CogNet
πΈ A network study of social interaction during the covid-19 lockdown
DarkGPT
Dark Patterns in Chatbot Design
Emma
𧬠The Emma dataset for four-dimensional Danish sentiment analysis
esbenkc
π¨βπ¦° Personal profile README
fnirs-bci
π§ Inspecting complexity and goal-directedness of imagination in an fNIRS BCI system.
Sentida
Sentida
othello_world
Emergent world representations: Exploring a sequence model trained on a synthetic task
esbenkc's Repositories
esbenkc/ai-cyberdefense
π₯ A repository for collecting cyberdefense thoughts, books, and documents about AI cyberdefense
esbenkc/fnirs-bci
π§ Inspecting complexity and goal-directedness of imagination in an fNIRS BCI system.
esbenkc/DarkGPT
Dark Patterns in Chatbot Design
esbenkc/CogNet
πΈ A network study of social interaction during the covid-19 lockdown
esbenkc/esbenkc
π¨βπ¦° Personal profile README
esbenkc/karnak
π Make sure AI applications are not injecting 1) suspicious API calls, 2) vulnerabilities, and 3) rogue capabilities
esbenkc/wikipod
π§ Fully automated Wikipedia audiobooks [WIP]
esbenkc/cyberwarfare
π€ Cyberwarfare vulnerabilities for democracy
esbenkc/tracr-mechint
esbenkc/anthrobench
π€π©βπ¦° Anthropomorphism Benchmark
esbenkc/benchmarks
π Benchmarking the safety of AI systems
esbenkc/democratic-inputs-to-ai
π€π Make AI values democratically guided
esbenkc/GGG
πΎ Global Game Jam Copenhagen Gamle GrenΓ₯ Gutter!
esbenkc/Guard-Llama
π¦ Testing the security of Llamas (specifically the third iteration, Llama 3)
esbenkc/multiagent-scaling
π For the Multi-Agent Safety Hackathon
esbenkc/pauseai
Website for PauseAI.info
esbenkc/robot-reviewer
π€ Helps you review papers by highlighting the areas relevant to your interests!
esbenkc/tdc2023-starter-kit
This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.
esbenkc/verification-jam
π¨βπ¬ Repository for the verification jam
esbenkc/A-
Founders' Cooperatives: How to organize ownership around an important mission?
esbenkc/aigov
π Submission for the AI Gov Hack
esbenkc/aj5
β¨ Github repo for work during aj5
esbenkc/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
esbenkc/devevals
π Studying developmental evals on Pythia models
esbenkc/gdpt
π GPT but it's GDPR. No private data getting sent to your favorite AGI companies.
esbenkc/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
esbenkc/othello_world
Emergent world representations: Exploring a sequence model trained on a synthetic task
esbenkc/personal-stats-website
π This website is to display my personal skill levels in RuneScape stats
esbenkc/ryebread
esbenkc/zapier
A Jupyter python notebook to Execute Zapier Tasks with GPT completion via Langchain