yang1fan2

AGI

Meta, ex-Tiktok, ex-Airbnb, Carnegie mellon University, Peking UniversitySunnyvale

yang1fan2's Stars

labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python57.9k 462 1335.9k
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python39k 386 1.7k4.3k
microsoft/autogen
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Language:Python37.3k 422 2.2k5.4k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python33.7k 277 5.9k5.2k
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda25k 252 1412.9k
VikParuchuri/marker
Convert PDF to markdown + JSON quickly with high accuracy
Language:Python19.2k 81 3181.1k
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell11.7k 73 901705
ShishirPatil/gorilla
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
Language:Python11.7k 97 2831k
dair-ai/ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
10.7k 895 6643
lucidrains/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Language:Python7.7k 143 48673
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
6.2k 98 11342
pymupdf/PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Language:Python6.2k 65 2.1k555
OpenBMB/ToolBench
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
Language:Python4.9k 49 306431
google-deepmind/alphageometry
Language:Python4.3k 54 131482
suragnair/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Language:Jupyter Notebook4k 113 1801.1k
microsoft/LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
Language:Python3.8k 55 141286
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Language:Python3.7k 30 391359
PhoebusSi/Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台，我们欢迎开源爱好者发起任何有意义的pr！
Language:Jupyter Notebook2.7k 35 100249
atfortes/Awesome-LLM-Reasoning
Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI o1 🍓
2.3k 40 3127
google-deepmind/code_contests
Language:C++2.1k 38 36210
GAIR-NLP/O1-Journey
O1 Replication Journey: A Strategic Progress Report – Part I
1.8k 35 1557
nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Language:Python1.8k 9 61359
google-research/FLAN
Language:Python1.5k 32 75156
srush/awesome-o1
A bibliography and survey of the papers surrounding o1
Language:TeX1k 24 042
volcengine/veScale
A PyTorch Native LLM Training Framework
Language:Python690 33 1836
volcengine/verl
veRL: Volcano Engine Reinforcement Learning for LLM
Language:Python663 11 3952
bigcode-project/bigcode-dataset
Language:Jupyter Notebook371 9 3962
SpursGoZmy/Awesome-Tabular-LLMs
We collect papers about "large language models (LLM) for table-related tasks", e.g., using LLM for Table QA task. “表格+LLM”相关论文整理
327 6 022
MARIO-Math-Reasoning/Super_MARIO
Language:Python290 12 3023
FreedomIntelligence/ReasoningNLP
paper list on reasoning in NLP
183 7 515

yang1fan2

yang1fan2's Stars

labmlai/annotated_deep_learning_paper_implementations

hpcaitech/ColossalAI

microsoft/autogen

vllm-project/vllm

karpathy/llm.c

VikParuchuri/marker

QwenLM/Qwen2.5

ShishirPatil/gorilla

dair-ai/ML-Papers-of-the-Week

lucidrains/PaLM-rlhf-pytorch

hijkzzz/Awesome-LLM-Strawberry

pymupdf/PyMuPDF

OpenBMB/ToolBench

google-deepmind/alphageometry

suragnair/alpha-zero-general

microsoft/LMOps

OpenRLHF/OpenRLHF

PhoebusSi/Alpaca-CoT

atfortes/Awesome-LLM-Reasoning

google-deepmind/code_contests

GAIR-NLP/O1-Journey

nikhilbarhate99/PPO-PyTorch

google-research/FLAN

srush/awesome-o1

volcengine/veScale

volcengine/verl

bigcode-project/bigcode-dataset

SpursGoZmy/Awesome-Tabular-LLMs

MARIO-Math-Reasoning/Super_MARIO

FreedomIntelligence/ReasoningNLP