apartresearch
Artificial intelligence will change the world. Our mission is to ensure this happens safely and to the benefit of everyone.
Pinned Repositories
ai-psychology-starter
Code templates to get started as an AI psychologist
aisafetyideas
💡 The web app CI/CD for aisafetyideas.com
deepdecipher
🦠 DeepDecipher: An open source API to MLP neurons
evaluations-starter
How to get started in evaluations and demonstrations research for dangerous capabilities
Integer_Addition
✱ Understanding the underlying learning dynamics of simple tasks in Transformer networks
interpretability-starter
🧠 Starter templates for doing interpretability research
Neuron2Graph
Tools for exploring Transformer neuron behaviour, including input pruning and diversification.
readingwhatwecan
📚📚📚📚📚📚📚📚📚 Reading everything
Research-Augmentation-Hackbook
specificityplus
👩💻 Code for the ACL paper "Detecting Edit Failures in LLMs: An Improved Specificity Benchmark"
apartresearch's Repositories
apartresearch/GooseAIUnityAPI
Integration for the OpenAi Api in Unity
apartresearch/safety-timelines
📈 Research into when alignment is solved
apartresearch/scaling-laws-viz
📈 Animating how AI FLOPs have developed over time
apartresearch/alignmentmarkets
📈 Bet on the progress of AI safety benchmarks
apartresearch/PySvelte
A library for bridging Python and HTML/Javascript (via Svelte) for creating interactive visualizations