mahdiabdollahpour's Stars
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
karpathy/llm.c
LLM training in simple, raw C/CUDA
Mozilla-Ocho/llamafile
Distribute and run LLMs with a single file.
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—language models
state-spaces/mamba
Mamba SSM architecture
huggingface/trl
Train transformer language models with reinforcement learning.
srush/GPU-Puzzles
Solve puzzles. Learn CUDA.
mistralai/mistral-inference
Official inference library for Mistral models
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
ashawkey/stable-dreamfusion
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
timothybrooks/instruct-pix2pix
pytorch/serve
Serve, optimize and scale PyTorch models in production
google-deepmind/alphageometry
srush/Tensor-Puzzles
Solve puzzles. Improve your pytorch.
lichao-sun/Mora
Mora: More like Sora for Generalist Video Generation
jiaweizzhao/GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
alexandre01/deepsvg
[NeurIPS 2020] Official code for the paper "DeepSVG: A Hierarchical Generative Network for Vector Graphics Animation". Includes a PyTorch library for deep learning with SVG data.
carlini/yet-another-applied-llm-benchmark
A benchmark to evaluate language models on questions I've previously asked them to solve.
rll/deepul
lhao499/ringattention
Transformers with Arbitrarily Large Context
kvablack/ddpo-pytorch
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
jannerm/ddpo
Code for the paper "Training Diffusion Models with Reinforcement Learning"
huggingface/llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
lhao499/language-quantized-autoencoders
Language Quantized AutoEncoders
parkervg/blendsql
Query language for blending SQL logic and LLM reasoning across structured + unstructured data. [Findings of ACL 2024]
luyug/magix
Supercharge huggingface transformers with model parallelism.
krishnaik06/Llamindex-Projects
MehranTaghian/SAC_GCN
Explainability of Deep RL algorithms using graph networks and layer-wise relevance propagation.
atoroghi/BIKG
MehranTaghian/mehrantaghian.github.io