lihuahua123

lihuahua123's Stars

OpenWebGAL/WebGAL
A brand new web Visual Novel engine | 全新的网页端视觉小说引擎
Language:TypeScript2.7k245
microsoft/autogen
A programming framework for agentic AI 🤖
Language:Jupyter Notebook32.1k4.7k
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Language:TypeScript48.5k6.9k
BBuf/how-to-optim-algorithm-in-cuda
how to optimize some algorithm in cuda.
Language:Cuda1.5k124
opendilab/LLMRiddles
Open-Source Reproduction/Demo of the LLM Riddles Game
Language:Python52138
Tencent/PatrickStar
PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.
Language:Python74757
ModelTC/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Language:Python2.5k198
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
2.7k183
luoxi-model/luoxi_models
see readme
Language:Python906
torchpipe/torchpipe
Serving Inside Pytorch
Language:C++14112
lihuahua123/Rayflow
a simple machine learning using ray
Language:Python1
nndeploy/nndeploy
nndeploy是一款模型端到端部署框架。以多端推理以及基于有向无环图模型部署为基础，致力为用户提供跨平台、简单易用、高性能的模型部署体验。
Language:C++62296
zjhellofss/KuiperInfer
校招、秋招、春招、实习好项目！带你从零实现一个高性能的深度学习推理库，支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step
Language:C++2.5k277
crossplane/crossplane
The Cloud Native Control Plane
Language:Go9.4k952
suquark/ExoFlow
A universal workflow system for exactly-once DAGs
Language:Python236
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Language:Python168k44.3k
reconfigurable-ml-pipeline/ipa
Source code of IPA, https://escholarship.org/uc/item/2p0805dq
Language:Jupyter Notebook108
facebookresearch/distributed_traces
Distributed tracing data from Meta's microservices architecture.
Language:Jupyter Notebook163
modelbox-ai/modelbox
A high performance, high expansion, easy to use framework for AI application. 为AI应用的开发者提供一套统一的高性能、易用的编程框架，快速基于AI全栈服务、开发跨端边云的AI行业应用，支持GPU，NPU加速。
Language:C++13539
sunface/rust-course
“连续八年成为全世界最受喜爱的语言，无 GC 也无需手动内存管理、极高的性能和安全性、过程/OO/函数式编程、优秀的包管理、JS 未来基石" — 工作之余的第二语言来试试 Rust 吧。本书拥有全面且深入的讲解、生动贴切的示例、德芙般丝滑的内容，这可能是目前最用心的 Rust 中文学习教程 / Book
Language:Rust25.4k2.2k
bytewax/bytewax
Python Stream Processing
Language:Python1.5k62
coderonion/awesome-llm-and-aigc
🚀🚀🚀A collection of some awesome public projects about Large Language Model, Vision Foundation Model and AI Generated Content.
51949
ztxz16/fastllm
纯c++的全平台llm加速库，支持python调用，chatglm-6B级模型单卡可达10000+token / s，支持glm, llama, moss基座，手机端流畅运行
Language:C++3.3k337
zahranajaf/PROS
Language:Python11
SymbioticLab/Kayak
Proactive-adaptive arbitration between shipping compute and shipping data
Language:Rust185
bug-developer021/YOLOV5_optimization_on_triton
Compare multiple optimization methods on triton to imporve model service performance
Language:Jupyter Notebook4611
fkh12345/ICE
Language:Python62
yxtj/VideoServing
Language:Python3
alpa-projects/mms
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)
Language:Python7811
PaddlePaddle/Serving
A flexible, high-performance carrier for machine learning models（『飞桨』服务化部署框架）
Language:C++898250