firstuserhere

Taking apart neural networks and putting them back together for a living. Personal website: https://kunvarthaman.com

firstuserhere's Stars

JoshuaDavid/utils_for_vastai
Personal utils for working with vast.ai. Probably not a good idea to use if you're not me.
Language:Shell1
irgolic/AutoPR
Run AI-powered workflows over your codebase
Language:Python1.2k82
dion-/autoheal
AutoGPT Agent which automatically fixes your tests. GPT-powered TDD.
Language:TypeScript10010
JoshuaDavid/sparse_coding
Using sparse coding to find distributed representations used by neural networks.
Language:Jupyter Notebook1
remyxai/FFMPerative
Chat to Compose Video
Language:Python1719
TransformerLensOrg/TransformerLens
A library for mechanistic interpretability of GPT-style language models
Language:Python1.5k284
firstuserhere/hackathon-attention-superposition
Language:Jupyter Notebook4
firstuserhere/interp-hackathon-layernorm
Investigating the 4.39 problem from Concrete Open Problems
Language:Jupyter Notebook1
JiahuiYu/generative_inpainting
DeepFill v1/v2 with Contextual Attention and Gated Convolution, CVPR 2018, and ICCV 2019 Oral
Language:Python3.2k781
mwatkins1970/SpellGPT
An experimental tool to explore GPT-3's "miraculous" ability not only to spell its own token strings (it being a "character blind" model) but also to use spelling as a means to produce novel outputs triggered by various "glitch tokens" (" SolidGoldMagikarp", et al.)
Language:Python111
R0bk/Transpector
Visual Transformer Mechanistic Analysis Tool
Language:JavaScript315
noanabeshima/solu_moe_layer
Language:Jupyter Notebook31
ArthurConmy/Automatic-Circuit-Discovery
Language:Jupyter Notebook17436
stanford-crfm/mistral
Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Face 🤗 Transformers.
Language:Python55948
UlisseMini/tinystories
Reproduction of TinyStories: How Small Can Language Models Be and Still Speak Coherent English?
Language:Python31
mercari/ml-system-design-pattern
System design patterns for machine learning
2.3k238
1rgs/jsonformer
A Bulletproof Way to Generate Structured JSON from Language Models
Language:Jupyter Notebook4.4k153
openai/automated-interpretability
Language:Python951113
RobertHuben/ffn_via_attention
Implements the components of a transformer (including feedforward networks) entirely via attention heads
Language:Python2
BorisTheBrave/nice-hooks
Convenience functions for working with pytorch hooks.
Language:Python6
adzcai/llama-ccs
Running Contrast-Consistent Search (https://arxiv.org/abs/2212.03827) on LLaMA
Language:Jupyter Notebook3
thestephencasper/mechanistic_interpretability_challenge
81
amrzv/awesome-colab-notebooks
Collection of google colaboratory notebooks for fast and easy experiments
Language:Python1.3k246
TransformerLensOrg/CircuitsVis
Mechanistic Interpretability Visualizations using React
Language:Jupyter Notebook18529
hunar4321/reweight-gpt
Reweight GPT - a simple neural network using transformer architecture for next character prediction
Language:Jupyter Notebook487
firstuserhere/awesome-mech-interp
An awesome curated list of resources dedicated to Mechanistic interpretability
1
victorlf4/orthello-simple-trafo-mech-int
Language:Python3
mwhea/Manifold_Trading_Bots
Language:JavaScript13
minosvasilias/gpt-manifold
An assistant for betting on prediction markets on manifold.markets, utilizing OpenAI's GPT APIs.
Language:Python302
vluzko/manifoldpy
Python tools for working with Manifold Markets
Language:Python3412

firstuserhere

firstuserhere's Stars

JoshuaDavid/utils_for_vastai

irgolic/AutoPR

dion-/autoheal

JoshuaDavid/sparse_coding

remyxai/FFMPerative

TransformerLensOrg/TransformerLens

firstuserhere/hackathon-attention-superposition

firstuserhere/interp-hackathon-layernorm

JiahuiYu/generative_inpainting

mwatkins1970/SpellGPT

R0bk/Transpector

noanabeshima/solu_moe_layer

ArthurConmy/Automatic-Circuit-Discovery

stanford-crfm/mistral

UlisseMini/tinystories

mercari/ml-system-design-pattern

1rgs/jsonformer

openai/automated-interpretability

RobertHuben/ffn_via_attention

BorisTheBrave/nice-hooks

adzcai/llama-ccs

thestephencasper/mechanistic_interpretability_challenge

amrzv/awesome-colab-notebooks

TransformerLensOrg/CircuitsVis

hunar4321/reweight-gpt

firstuserhere/awesome-mech-interp

victorlf4/orthello-simple-trafo-mech-int

mwhea/Manifold_Trading_Bots

minosvasilias/gpt-manifold

vluzko/manifoldpy