sunying2018's Stars
allenai/open-instruct
mindspore-lab/mindrl
A high-performance, scalable MindSpore reinforcement learning framework.
mindspore-lab/mindrlhf
haonan3/AnchorContext
AnchorAttention: Improved attention for LLMs long-context training
bryanchrist/MathNeuro
Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes
SimpleBerry/LLaMA-O1
Large Reasoning Models
GAIR-NLP/auto-j
Generative Judge for Evaluating Alignment
mit-han-lab/duo-attention
DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads
XueFuzhao/awesome-mixture-of-experts
A collection of AWESOME things about mixture-of-experts
arpita8/Awesome-Mixture-of-Experts-Papers
Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.
OpenCoder-llm/OpenCoder-llm
The Open Cookbook for Top-Tier Code Large Language Model
volcengine/verl
veRL: Volcano Engine Reinforcement Learning for LLM
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Tencent/Tencent-Hunyuan-Large
wdndev/ai_interview_note
DL & ML & RS
wdndev/mllm_interview_note
主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
VikParuchuri/marker
Convert PDF to markdown quickly with high accuracy
epfLLM/Megatron-LLM
distributed trainer for LLMs
BatsResearch/bonito
A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.
OpenGVLab/Vision-RWKV
Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
princeton-nlp/ProLong
Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"
princeton-nlp/HELMET
The HELMET Benchmark
GAIR-NLP/ProX
Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
lyhue1991/eat_pytorch_in_20_days
Pytorch🍊🍉 is delicious, just eat it! 😋😋
facebookresearch/fastText
Library for fast text representation and classification.
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.