jealous1989

coding on line 24 hours

jealous1989's Stars

CompVis/stable-diffusion
A latent text-to-image diffusion model
Language:Jupyter Notebook68.5k 558 71610.2k
meta-llama/llama
Inference code for Llama models
Language:Python56.6k 526 1k9.6k
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Language:MDX50.5k 563 2064.9k
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Language:Python40.7k 395 1.3k5.2k
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python38.8k 385 1.7k4.3k
langflow-ai/langflow
Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.
Language:JavaScript35.7k 276 1.6k4.2k
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Language:Python18.4k 185 7311.9k
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:Python12.7k 132 218867
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python10.7k 162 7852.4k
mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python10.4k 79 496990
taskflow/taskflow
A General-purpose Task-parallel Programming System using Modern C++
Language:C++10.3k 254 4671.2k
lllyasviel/stable-diffusion-webui-forge
Language:Python8.6k 85 1.5k852
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Language:Python8.4k 146 3.8k1.5k
bentoml/BentoML
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
Language:Python7.2k 77 1.1k792
kohya-ss/sd-scripts
Language:Python5.3k 56 1.1k880
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python4.7k 38 1.5k432
openkruise/kruise
Automated management of large-scale applications on Kubernetes (incubating project under CNCF)
Language:Go4.7k 90 679769
luosiallen/latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Language:Python4.4k 61 96231
shawwn/llama-dl
High-speed download of LLaMA, Facebook's 65B parameter GPT model
Language:Shell4.2k 68 15416
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism etc. 🎉🎉
2.9k 93 6198
ModelTC/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Language:Python2.6k 23 185207
predibase/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Language:Python2.2k 33 245145
ChunelFeng/CGraph
【A common used C++ DAG framework】一个通用的、无三方依赖的、跨平台的、收录于awesome-cpp的、基于流图的并行计算框架。欢迎star & fork & 交流
Language:C++1.8k 31 396324
run-house/runhouse
Dispatch and distribute your ML training to "serverless" clusters in Python, like PyTorch for ML infra. Iterable, debuggable, multi-cloud/on-prem, identical across research and production.
Language:Python979 9 5137
ChenRocks/UNITER
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
Language:Python785 18 94109
ChunelFeng/CThreadPool
【A simple used C++ threadpool】一个简单好用，性能优异的，跨平台的C++线程池。欢迎 star & fork
Language:C++350 5 666
NetEase-FuXi/EET
Easy and Efficient Transformer : Scalable Inference Solution For Large NLP model
Language:Python261 6 1046
pybind/pybind11_bazel
Bazel wrapper around the pybind11 repository
Language:Starlark104 13 5654
tonyduan/diffusion
From-scratch diffusion model implemented in PyTorch.
Language:Python77 2 39
ColdHeat/pystarlark
Experimental Python bindings for starlark-go
Language:C17 3 31

jealous1989

jealous1989's Stars

CompVis/stable-diffusion

meta-llama/llama

dair-ai/Prompt-Engineering-Guide

THUDM/ChatGLM-6B

hpcaitech/ColossalAI

langflow-ai/langflow

ymcui/Chinese-LLaMA-Alpaca

BlinkDL/RWKV-LM

NVIDIA/Megatron-LM

mlfoundations/open_clip

taskflow/taskflow

lllyasviel/stable-diffusion-webui-forge

triton-inference-server/server

bentoml/BentoML

kohya-ss/sd-scripts

InternLM/lmdeploy

openkruise/kruise

luosiallen/latent-consistency-model

shawwn/llama-dl

DefTruth/Awesome-LLM-Inference

ModelTC/lightllm

predibase/lorax

ChunelFeng/CGraph

run-house/runhouse

ChenRocks/UNITER

ChunelFeng/CThreadPool

NetEase-FuXi/EET

pybind/pybind11_bazel

tonyduan/diffusion

ColdHeat/pystarlark