yichuan520030910320's Stars
jingyi0000/VLM_survey
Collection of AWESOME vision-language models for vision tasks
openai/mle-bench
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
openai/swarm
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
NVlabs/Minitron
A family of compressed models obtained via pruning and knowledge distillation
GAIR-NLP/O1-Journey
O1 Replication Journey: A Strategic Progress Report – Part I
OptimalScale/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
xichen-fy/Fira
Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?
meta-llama/PurpleLlama
Set of tools to assess and improve LLM security.
Doriandarko/o1-engineer
o1-engineer is a command-line tool designed to assist developers in managing and interacting with their projects efficiently. Leveraging the power of OpenAI's API, this tool provides functionalities such as code generation, file editing, and project planning to streamline your development workflow.
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—language models
trotsky1997/MathBlackBox
THUDM/ReST-MCTS
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
magpie-align/magpie
Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
YaoJiayi/CacheBlend
Stability-AI/generative-models
Generative Models by Stability AI
Ying1123/VTC-artifact
cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
microsoft/autogen
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
LMCache/LMCache
Making Long-Context LLM Inference 10x Faster and 10x Cheaper
efeslab/Nanoflow
A throughput-oriented high-performance serving framework for LLMs
All-Hands-AI/OpenHands
🙌 OpenHands: Code Less, Make More
OpenBMB/ChatDev
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
gusye1234/nano-graphrag
A simple, easy-to-hack GraphRAG implementation
AlibabaPAI/llumnix
Efficient and easy multi-instance LLM serving
koayon/awesome-adaptive-computation
A curated reading list of research in Adaptive Computation, Inference-Time Computation & Mixture of Experts (MoE).