PuyangChen

PuyangChen's Stars

bpc-clone/bypass-paywalls-firefox-clean
2.2k87
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。
17k1.6k
adithya-s-k/omniparse
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
Language:Python5.9k475
tablegpt/tablegpt-agent
A pre-built agent for TableGPT2.
Language:Python34823
kyegomez/swarms
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework Join our Community: https://discord.com/servers/agora-999382051935506503
Language:Python2.1k293
Open-Source-O1/Open-O1
Language:Python1k34
hey-it-s-me/CoRPLE
An implementation of Contourlet Residual for Prompt Learning Enhanced Infrared Image Super-Resolution(CoRPLE)
Language:Python584
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
Language:Python13.1k1.1k
microsoft/LLMLingua
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
Language:Python4.8k266
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Language:Python46.2k5.5k
microsoft/autogen
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Language:Python36.3k5.3k
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python4.9k445
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Language:TypeScript56.1k8.3k
triton-lang/triton
Development repository for the Triton language and compiler
Language:C++13.8k1.7k
OrionStarAI/Orion
Orion-14B is a family of models includes a 14B foundation LLM, and a series of models: a chat model, a long context model, a quantized model, a RAG fine-tuned model, and an Agent fine-tuned model. Orion-14B 系列模型包括一个具有140亿参数的多语言基座大模型以及一系列相关的衍生模型，包括对话模型，长文本模型，量化模型，RAG微调模型，Agent微调模型等。
Language:Python79157
leptonai/leptonai
A Pythonic framework to simplify AI service building
Language:Python2.7k173
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell11.2k682
OpenBMB/MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
Language:Jupyter Notebook7.2k461
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:Python12.9k875
leptonai/search_with_lepton
Building a quick conversation-based search demo with Lepton AI.
Language:TypeScript7.9k1k
hpcaitech/SwiftInfer
Efficient AI Inference & Serving
Language:Python46126
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python14.8k1.4k
lyogavin/airllm
AirLLM 70B inference with single 4GB GPU
Language:Jupyter Notebook5.5k437
mistralai/mistral-inference
Official inference library for Mistral models
Language:Jupyter Notebook9.8k871
SkyworkAI/Skywork
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数，训练数据，评估数据，评估方法。
Language:Python1.2k110
eric-ai-lab/MiniGPT-5
Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"
Language:Python85852
hy-zhao23/Explainability-for-Large-Language-Models
11512
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Language:Python6.7k374
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python36.8k4.5k
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
Language:Python4.1k297