linohan

linohan's Stars

DIRECT-BIT/SRA-MCTS
Language:Python233
mlc-ai/xgrammar
Efficient, Flexible and Portable Structured Generation
Language:C++53530
link1st/go-stress-testing
go 实现的压测工具，ab、locust、Jmeter压测工具介绍【单台机器100w连接压测实战】
Language:Go4k804
ModelCloud/GPTQModel
Production ready LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.
Language:Python18031
lqtrung1998/mwp_ReFT
Language:Python40349
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Language:Python6.9k635
IST-DASLab/gptq
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
Language:Python2k158
AutoGPTQ/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Language:Python4.6k490
ggerganov/llama.cpp
LLM inference in C/C++
Language:C++70k10.1k
kvcache-ai/Mooncake
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
Language:C++2.3k130
peng-zhihui/Dummy-Robot
我的超迷你机械臂机器人项目。
Language:C12.6k2.8k
minghchen/automanual
Code for NeurIPS 2024 paper "AutoManual: Constructing Instruction Manuals by LLM Agents via Interactive Environmental Learning"
Language:PDDL272
thunlp/ProactiveAgent
A LLM-based Agent that predict its tasks proactively.
Language:Python26925
AIDC-AI/Marco-o1
An Open Large Reasoning Model for Real-World Solutions
Language:Python1.3k65
xjdr-alt/entropix
Entropy Based Sampling and Parallel CoT Decoding
Language:Python3.2k317
HuggingAGI/HuggingArxiv
22326
feifeibear/LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
Language:Python61563
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
Language:Python4.1k297
GAIR-NLP/O1-Journey
O1 Replication Journey: A Strategic Progress Report – Part I
1.8k54
SimpleBerry/LLaMA-O1
Large Reasoning Models
Language:Python75643
ymcui/Chinese-LLaMA-Alpaca-3
中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3
Language:Python1.8k154
THUDM/AgentTuning
AgentTuning: Enabling Generalized Agent Abilities for LLMs
Language:Python1.4k95
ZHZisZZ/weak-to-strong-search
[NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models
Language:Python553
X-PLUG/Multi-LLM-Agent
Language:Python20524
datawhalechina/self-llm
《开源大模型食用指南》针对**宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程
Language:Jupyter Notebook10.6k1.2k
inspirai/TimeChamber
A Massively Parallel Large Scale Self-Play Framework
Language:Python32032
WangXFng/TrieLLM
A VERY SIMPLE example to control LLMs for text generations via a Custom Trie (prefix tree).
Language:Python9
ArronAI007/Awesome-AGI
AGI资料汇总学习（主要包括LLM和AIGC），持续更新......
Language:Jupyter Notebook31926
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
6k328
microsoft/RD-Agent
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committed to automating these high-value generic R&D processes through our open source R&D automation tool RD-Agent, which lets AI drive data-driven AI.
Language:Python1.3k113