saahithjanapati

saahithjanapati's Stars

labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python57.4k 458 1325.9k
black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python18.6k 159 01.3k
tensorflow/tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Language:Python15.7k 469 1.2k3.5k
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook13.9k 98 181.1k
princeton-nlp/SWE-agent
[NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.
Language:Python13.8k 97 3871.4k
gpu-mode/lectures
Material for gpu-mode lectures
Language:Jupyter Notebook3.3k 49 9334
mpoon/gpt-repository-loader
Convert code repos into an LLM prompt-friendly format. Mostly built by GPT-4.
Language:Python2.9k 34 24230
johnma2006/mamba-minimal
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
Language:Python2.7k 24 28194
lucidrains/perceiver-pytorch
Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch
Language:Python1.1k 31 59134
rohitgandikota/sliders
Concept Sliders for Precise Control of Diffusion Models
Language:Jupyter Notebook986 13 11579
pmichel31415/are-16-heads-really-better-than-1
Code for the paper "Are Sixteen Heads Really Better than One?"
Language:Shell170 5 1114
rafaljozefowicz/lm
Language:Python165 21 757
lancopku/label-words-are-anchors
Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
Language:Python158 2 2813
openai/neural-gpu
Code for the Neural GPU model originally described in "Neural GPUs Learn Algorithms"
Language:Python138 167 268
saprmarks/feature-circuits
Language:Python118 4 523
jakegrigsby/deep_control
Deep Reinforcement Learning for Continuous Control in PyTorch
Language:Python93 6 114
ansuini/IntrinsicDimDeep
Code for the intrinsic dimensionality estimate of data representations
Language:Jupyter Notebook77 4 411
jbloomAus/DecisionTransformerInterpretability
Interpreting how transformers simulate agents performing RL tasks
Language:Jupyter Notebook76 4 7418
jxmorris12/gptzip
Losslessly encode text natively with arithmetic coding and HuggingFace Transformers
Language:Python71 1 16
bacnguyencong/rbm-pytorch
An implementation of Restricted Boltzmann Machine in Pytorch
Language:Jupyter Notebook50 4 012
evandez/relations
How do transformer LMs encode relations?
Language:Jupyter Notebook40 4 411
csinva/interpretable-embeddings
Interpretable text embeddings by asking LLMs yes/no questions (NeurIPS 2024)
Language:Python31 4 01
dannyallover/overthinking_the_truth
Language:Jupyter Notebook28 1 15
for-ai/llm-profiling-toolkit
Language:Python140
ArthurConmy/MishformerLens
MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing a custom, numerically inaccurate Transformer architecture.
Language:Python10 2 00
rohitgandikota/erasing-llm
Erasing conceptual knowledge from language models through low-rank fine-tuning
Language:Python10 1 41
tomfletcher/GeometryOfData
Language:Jupyter Notebook7 3 07
jmmanley/two-nn-dimensionality-estimator
Implementation of TWO-NN method for estimating intrinsic dimension (Facco et al., 2017, Scientific Reports).
Language:Python6 2 01
diegodoimo/geometry_icl_finetuning
Language:Python3 1 20
Fradenti/GRIDE_repo
Language:R20