SeunghyunSEO's Stars
microsoft/generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
meta-llama/llama3
The official Meta Llama 3 GitHub site
unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
plasma-umass/scalene
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
mosaicml/llm-foundry
LLM training code for Databricks foundation models
openai/transformer-debugger
pytorch/torchtitan
A native PyTorch Library for large model training
AnswerDotAI/fsdp_qlora
Training LLMs with QLoRA + FSDP
microsoft/mup
maximal update parametrization (µP)
databricks/megablocks
cuda-mode/resource-stream
CUDA related news and material links
lilacai/lilac
Curate better data for LLMs
mistralai/megablocks-public
jzhang38/EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
alibaba/Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
zhuzilin/ring-flash-attention
Ring attention implementation with flash attention
cloneofsimo/minRF
Minimal implementation of scalable rectified flow transformers, based on SD3's approach
lucidrains/triton-transformer
Implementation of a Transformer, but completely in Triton
foundation-model-stack/fms-fsdp
🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash attention v2.
cloneofsimo/d3pm
Minimal Implementation of a D3PM in pytorch
cloneofsimo/min-max-gpt
Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training
mgmalek/efficient_cross_entropy
qtli/GSM-Plus
GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
cloneofsimo/auto_llm_codebase_analysis
cloneofsimo/project_RF
likenneth/persona_drift
Measuring and Controlling Persona Drift in Language Model Dialogs
cloneofsimo/reverse_eng_deepspeed_study
DeepSpeed Study, focused on reverse engineering and enhancing documentation
mgmalek/ring-attention