arthurcbaia's Stars
keras-team/keras
Deep Learning for humans
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
astral-sh/uv
An extremely fast Python package and project manager, written in Rust.
tinygrad/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
unslothai/unsloth
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
joerick/pyinstrument
🚴 Call stack profiler for Python. Shows you why your code is slow!
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
pytorch/torchtune
PyTorch native post-training library
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉
bclavie/RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
imaurer/awesome-llm-json
Resource list for generating JSON using LLMs via function calling, tools, CFG. Libraries, Models, Notebooks, etc.
zanfranceschi/rinha-de-backend-2024-q1
Repositório da 2ª edição da Rinha de Backend
AnswerDotAI/fsdp_qlora
Training LLMs with QLoRA + FSDP
uclaml/SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
scaleapi/llm-engine
Scale LLM Engine public repository
abacaj/fine-tune-mistral
Fine-tune mistral-7B on 3090s, a100s, h100s
Tanuki/tanuki.py
Prompt engineering for developers
nomic-ai/contrastors
Train Models Contrastively in Pytorch
rwitten/HighPerfLLMs2024
crabcamp/lexrank
LexRank algorithm for text summarization
UnderstandLingBV/LLaMa2lang
Convenience scripts to finetune (chat-)LLaMa3 and other models for any language
architsharma97/dpo-rlaif
fixie-ai/ai-benchmarks
Benchmarking suite for popular AI APIs
cipher982/llm-benchmarks
Benchmarking LLM Inference Speeds
LIAAD/PT-Pump-Up
Hub for the Portuguese language NLP Resources