firstuserhere

Taking apart neural networks and putting them back together for a living

Pinned Repositories

activationsteering
00
ARENA_2.0
Language:HTML00
automated-interpretability
Language:Python00
awesome-colab-notebooks
Collection of google colaboratory notebooks for fast and easy experiments
Language:Python00
awesome-mech-interp
An awesome curated list of resources dedicated to Mechanistic interpretability
10
basic-scripts
a bunch of basic scripts hacked together but working and are maybe useful for me
Language:Jupyter Notebook00
firstuserhere.github.io
Language:HTML20
gpt4Vadvanced
Testing GPT-4 Vision on Advanced examination questions (2023) across physics, chemistry, and mathematics
Language:JavaScript20
hackathon-attention-superposition
Language:Jupyter Notebook40
interp-hackathon-layernorm
Investigating the 4.39 problem from Concrete Open Problems
Language:Jupyter Notebook10

firstuserhere's Repositories

firstuserhere/hackathon-attention-superposition
Language:Jupyter Notebook40
firstuserhere/firstuserhere.github.io
Language:HTML20
firstuserhere/gpt4Vadvanced
Testing GPT-4 Vision on Advanced examination questions (2023) across physics, chemistry, and mathematics
Language:JavaScript20
firstuserhere/awesome-mech-interp
An awesome curated list of resources dedicated to Mechanistic interpretability
10
firstuserhere/interp-hackathon-layernorm
Investigating the 4.39 problem from Concrete Open Problems
Language:Jupyter Notebook10
firstuserhere/multimodal-mechinterp
Basic mech interp analysis for some multimodal models
1 1 0
firstuserhere/activationsteering
00
firstuserhere/ARENA_2.0
Language:HTML00
firstuserhere/automated-interpretability
Language:Python00
firstuserhere/awesome-colab-notebooks
Collection of google colaboratory notebooks for fast and easy experiments
Language:Python00
firstuserhere/basic-scripts
a bunch of basic scripts hacked together but working and are maybe useful for me
Language:Jupyter Notebook00
firstuserhere/firstuserhere
Config files for my GitHub profile.
00
firstuserhere/gpt-manifold
Forking to add functionality for automated betting
Language:Python00
firstuserhere/IF
Language:Python00
firstuserhere/manifold
Manifold Markets: A market for every question
Language:TypeScript00
firstuserhere/GPU-Puzzles
Solve puzzles. Learn CUDA.
Language:Jupyter Notebook
firstuserhere/Improved-worldmodels
Critiques of the pre-print, suggestions for improvement, and counterfactual examples testing
firstuserhere/jsonformer
A Bulletproof Way to Generate Structured JSON from Language Models
firstuserhere/LLaVA-mechinterp
firstuserhere/Megatron-LM
Ongoing research training transformer models at scale
firstuserhere/miras-sudoku-solution
Fork of a possible solution for testing
firstuserhere/nanogenmo
National Novel Generation Month, 2023 edition.
firstuserhere/practiceCUDA
firstuserhere/replications
My attempts at replicating results of papers
firstuserhere/sparse_autoencoder
Sparse Autoencoder for Mechanistic Interpretability
firstuserhere/SPARta
LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces
firstuserhere/tinystories
Reproduction of TinyStories: How Small Can Language Models Be and Still Speak Coherent English?
firstuserhere/visual-chatgpt
VisualChatGPT
firstuserhere/weak-to-strong
firstuserhere/Whisper-mechinterp
Mechanistic Interpretability for Whisper