linohan's Stars
DIRECT-BIT/SRA-MCTS
mlc-ai/xgrammar
Efficient, Flexible and Portable Structured Generation
link1st/go-stress-testing
go 实现的压测工具,ab、locust、Jmeter压测工具介绍【单台机器100w连接压测实战】
ModelCloud/GPTQModel
Production ready LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.
lqtrung1998/mwp_ReFT
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
IST-DASLab/gptq
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
AutoGPTQ/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
ggerganov/llama.cpp
LLM inference in C/C++
kvcache-ai/Mooncake
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
peng-zhihui/Dummy-Robot
我的超迷你机械臂机器人项目。
minghchen/automanual
Code for NeurIPS 2024 paper "AutoManual: Constructing Instruction Manuals by LLM Agents via Interactive Environmental Learning"
thunlp/ProactiveAgent
A LLM-based Agent that predict its tasks proactively.
AIDC-AI/Marco-o1
An Open Large Reasoning Model for Real-World Solutions
xjdr-alt/entropix
Entropy Based Sampling and Parallel CoT Decoding
HuggingAGI/HuggingArxiv
feifeibear/LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
GAIR-NLP/O1-Journey
O1 Replication Journey: A Strategic Progress Report – Part I
SimpleBerry/LLaMA-O1
Large Reasoning Models
ymcui/Chinese-LLaMA-Alpaca-3
中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3
THUDM/AgentTuning
AgentTuning: Enabling Generalized Agent Abilities for LLMs
ZHZisZZ/weak-to-strong-search
[NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models
X-PLUG/Multi-LLM-Agent
datawhalechina/self-llm
《开源大模型食用指南》针对**宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
inspirai/TimeChamber
A Massively Parallel Large Scale Self-Play Framework
WangXFng/TrieLLM
A VERY SIMPLE example to control LLMs for text generations via a Custom Trie (prefix tree).
ArronAI007/Awesome-AGI
AGI资料汇总学习(主要包括LLM和AIGC),持续更新......
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
microsoft/RD-Agent
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committed to automating these high-value generic R&D processes through our open source R&D automation tool RD-Agent, which lets AI drive data-driven AI.