lizuyao2010

software engineer, ai, machine learning, nlp

lizuyao2010's Stars

hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python36k4.4k
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Python3.6k221
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python37.2k4.6k
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
Language:Python1k71
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Language:Jupyter Notebook35.1k4.3k
unslothai/unsloth
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory
Language:Python19.2k1.4k
NVIDIA/TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
Language:C++10.9k2.1k
wangshusen/RecommenderSystem
2.6k386
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.8k417
triton-lang/triton
Development repository for the Triton language and compiler
Language:C++13.7k1.7k
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python35.8k4.2k
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.8k631
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python22.6k2.2k
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python38.9k4.4k
ollama/ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
Language:Go103k8.2k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python31.8k4.8k
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda24.7k2.8k
ggerganov/llama.cpp
LLM inference in C/C++
Language:C++69.2k9.9k
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—language models
Language:Python20.1k1.5k
xai-org/grok-1
Grok open release
Language:Python49.7k8.3k
meta-llama/llama
Inference code for Llama models
Language:Python56.8k9.6k
ml-explore/mlx
MLX: An array framework for Apple silicon
Language:C++17.8k1k
vithursant/nanoGPT_mlx
Port of Andrej Karpathy's nanoGPT to Apple MLX framework.
Language:Python1009
cs230-stanford/cs230-code-examples
Code examples in pyTorch and Tensorflow for CS230
Language:Python4k998
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python37.8k6.1k
hyunwoongko/transformer
Transformer: PyTorch Implementation of "Attention Is All You Need"
Language:Python3.1k449
karpathy/makemore
An autoregressive character-level language model for making more things
Language:Python2.6k695
labuladong/fucking-algorithm
刷算法全靠套路，认准 labuladong 就够了！English version supported! Crack LeetCode, not only how, but also why.
Language:Markdown126k23.2k
google-research/bert
TensorFlow code and pre-trained models for BERT
Language:Python38.4k9.6k
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.7k6.4k

lizuyao2010

lizuyao2010's Stars

hiyouga/LLaMA-Factory

QwenLM/Qwen2-VL

lm-sys/FastChat

RLHFlow/RLHF-Reward-Modeling

rasbt/LLMs-from-scratch

unslothai/unsloth

NVIDIA/TensorRT

wangshusen/RecommenderSystem

huggingface/alignment-handbook

triton-lang/triton

microsoft/DeepSpeed

facebookresearch/xformers

hpcaitech/Open-Sora

hpcaitech/ColossalAI

ollama/ollama

vllm-project/vllm

karpathy/llm.c

ggerganov/llama.cpp

stanfordnlp/dspy

xai-org/grok-1

meta-llama/llama

ml-explore/mlx

vithursant/nanoGPT_mlx

cs230-stanford/cs230-code-examples

karpathy/nanoGPT

hyunwoongko/transformer

karpathy/makemore

labuladong/fucking-algorithm

google-research/bert

facebookresearch/fairseq