mahdiabdollahpour

Graduate Student at University of Toronto

Toronto

mahdiabdollahpour's Stars

karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python37.5k 377 3186k
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda24.5k 246 1412.8k
Mozilla-Ocho/llamafile
Distribute and run LLMs with a single file.
Language:C++20.6k 177 4271k
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—language models
Language:Python19.2k 144 8361.5k
state-spaces/mamba
Mamba SSM architecture
Language:Python13.3k 98 5531.1k
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python10.2k 77 1.2k1.3k
srush/GPU-Puzzles
Solve puzzles. Learn CUDA.
Language:Jupyter Notebook10k 192 32861
mistralai/mistral-inference
Official inference library for Mistral models
Language:Jupyter Notebook9.8k 126 145871
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Language:Python9.2k 85 38868
ashawkey/stable-dreamfusion
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
Language:Python8.3k 126 301734
timothybrooks/instruct-pix2pix
Language:Python6.4k 70 126539
pytorch/serve
Serve, optimize and scale PyTorch models in production
Language:Java4.2k 57 1.6k864
google-deepmind/alphageometry
Language:Python4.2k 53 124469
srush/Tensor-Puzzles
Solve puzzles. Improve your pytorch.
Language:Jupyter Notebook3.3k 13 20280
lichao-sun/Mora
Mora: More like Sora for Generalist Video Generation
Language:Python1.5k 78 1197
jiaweizzhao/GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Language:Python1.4k 19 54149
alexandre01/deepsvg
[NeurIPS 2020] Official code for the paper "DeepSVG: A Hierarchical Generative Network for Vector Graphics Animation". Includes a PyTorch library for deep learning with SVG data.
Language:Jupyter Notebook980 24 3898
carlini/yet-another-applied-llm-benchmark
A benchmark to evaluate language models on questions I've previously asked them to solve.
Language:Python916 17 1466
rll/deepul
Language:Jupyter Notebook766 61 14377
lhao499/ringattention
Transformers with Arbitrarily Large Context
Language:Python571 5 1543
kvablack/ddpo-pytorch
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
Language:Python434 6 2441
jannerm/ddpo
Code for the paper "Training Diffusion Models with Reinforcement Learning"
Language:Python356 7 1126
huggingface/llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
Language:Python238 33 123
lhao499/language-quantized-autoencoders
Language Quantized AutoEncoders
Language:Python92 1 35
parkervg/blendsql
Query language for blending SQL logic and LLM reasoning across structured + unstructured data. [Findings of ACL 2024]
Language:Python76 4 104
luyug/magix
Supercharge huggingface transformers with model parallelism.
Language:Python75 2 13
krishnaik06/Llamindex-Projects
Language:Jupyter Notebook48 4 757
MehranTaghian/SAC_GCN
Explainability of Deep RL algorithms using graph networks and layer-wise relevance propagation.
Language:Jupyter Notebook8 2 00
atoroghi/BIKG
Language:Python1
MehranTaghian/mehrantaghian.github.io
Language:Jupyter Notebook1 1 00

mahdiabdollahpour

mahdiabdollahpour's Stars

karpathy/nanoGPT

karpathy/llm.c

Mozilla-Ocho/llamafile

stanfordnlp/dspy

state-spaces/mamba

huggingface/trl

srush/GPU-Puzzles

mistralai/mistral-inference

karpathy/minbpe

ashawkey/stable-dreamfusion

timothybrooks/instruct-pix2pix

pytorch/serve

google-deepmind/alphageometry

srush/Tensor-Puzzles

lichao-sun/Mora

jiaweizzhao/GaLore

alexandre01/deepsvg

carlini/yet-another-applied-llm-benchmark

rll/deepul

lhao499/ringattention

kvablack/ddpo-pytorch

jannerm/ddpo

huggingface/llm-swarm

lhao499/language-quantized-autoencoders

parkervg/blendsql

luyug/magix

krishnaik06/Llamindex-Projects

MehranTaghian/SAC_GCN

atoroghi/BIKG

MehranTaghian/mehrantaghian.github.io