retonym

intel

retonym's Stars

ollama/ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
Language:Go105k 608 5.3k8.4k
xai-org/grok-1
Grok open release
Language:Python49.8k 591 2148.3k
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Language:Jupyter Notebook40.4k 418 694.3k
microsoft/autogen
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Language:Jupyter Notebook36.5k 419 2.2k5.3k
huihut/interview
📚 C/C++ 技术面试基础知识总结，包括语言、程序库、数据结构、算法、系统、网络、链接装载库等知识及面试经验、招聘、内推等信息。This repository is a summary of the basic knowledge of recruiting job seekers and beginners in the direction of C/C++ technology, including language, program library, data structure, algorithm, system, network, link loading library, interview experience, recruitment, recommendation, etc.
Language:C++35.1k 865 638k
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda24.9k 252 1412.8k
unslothai/unsloth
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
Language:Python19.8k 134 1.2k1.4k
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook15.8k 204 3982.3k
sympy/sympy
A computer algebra system written in pure Python
Language:Python13.2k 294 13.4k4.5k
srush/GPU-Puzzles
Solve puzzles. Learn CUDA.
Language:Jupyter Notebook10.2k 193 32966
vosen/ZLUDA
CUDA on non-NVIDIA GPUs
Language:Rust10k 135 182663
brexhq/prompt-engineering
Tips and tricks for working with Large Language Models like OpenAI's GPT-4.
8.5k 87 3398
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Language:Python6.7k 65 84373
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
Language:Python6.5k 52 1.1k644
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Language:Python5.7k 60 106520
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python5k 40 1.6k449
forthespada/CampusShame
互联网仍有记忆！那些曾经在校招过程中毁过口头offer、意向书、三方的公司！纵然人微言轻，也想尽绵薄之力！
3.3k 37 19163
ModelTC/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Language:Python2.7k 22 191217
kvcache-ai/Mooncake
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
Language:C++2.3k 27 33130
DefTruth/CUDA-Learn-Notes
📚150+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
Language:Cuda1.8k 14 9189
NVIDIA/cccl
CUDA Core Compute Libraries
Language:C++1.4k 32 1.6k170
facebookincubator/gloo
Collective communications library with various primitives for multi-machine training.
Language:C++1.2k 61 118304
cuda-mode/awesomeMLSys
An ML Systems Onboarding list
505 9 019
BobMcDear/attorch
A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.
Language:Python499 11 724
FlagOpen/FlagGems
FlagGems is an operator library for large language models implemented in Triton Language.
Language:Python381 19 8356
mlops-discord/gpu-optimization-workshop
Slides, notes, and materials for the workshop
308 9 031
hkproj/pytorch-llama
LLaMA 2 implemented from scratch in PyTorch
Language:Python270 7 1151
FlagOpen/FlagAttention
A collection of memory efficient attention operators implemented in the Triton language.
Language:Python228 7 316
ifromeast/cuda_learning
learning how CUDA works
Language:Cuda178 4 323
EricPengShuai/Interview
CPP面试修炼
Language:C++136 2 018

retonym

retonym's Stars

ollama/ollama

xai-org/grok-1

mlabonne/llm-course

microsoft/autogen

huihut/interview

karpathy/llm.c

unslothai/unsloth

meta-llama/llama-recipes

sympy/sympy

srush/GPU-Puzzles

vosen/ZLUDA

brexhq/prompt-engineering

mit-han-lab/streaming-llm

bitsandbytes-foundation/bitsandbytes

pytorch-labs/gpt-fast

InternLM/lmdeploy

forthespada/CampusShame

ModelTC/lightllm

kvcache-ai/Mooncake

DefTruth/CUDA-Learn-Notes

NVIDIA/cccl

facebookincubator/gloo

cuda-mode/awesomeMLSys

BobMcDear/attorch

FlagOpen/FlagGems

mlops-discord/gpu-optimization-workshop

hkproj/pytorch-llama

FlagOpen/FlagAttention

ifromeast/cuda_learning

EricPengShuai/Interview