liangfu's Stars
ggerganov/llama.cpp
LLM inference in C/C++
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
pola-rs/polars
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
guidance-ai/guidance
A guidance language for controlling large language models.
mlflow/mlflow
Open source platform for the machine learning lifecycle
karpathy/llama2.c
Inference Llama 2 in one file of pure C
triton-lang/triton
Development repository for the Triton language and compiler
pyodide/pyodide
Pyodide is a Python distribution for the browser and Node.js based on WebAssembly
aws/chalice
Python Serverless Microframework for AWS
huggingface/trl
Train transformer language models with reinforcement learning.
FMInference/FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
NVIDIA/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
skorch-dev/skorch
A scikit-learn compatible neural network library that wraps PyTorch
evcxr/evcxr
lavague-ai/LaVague
Large Action Model framework to develop AI Web Agents
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
mosaicml/llm-foundry
LLM training code for Databricks foundation models
microsoft/hummingbird
Hummingbird compiles trained ML models into tensor computation for faster inference.
google/BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
guillaume-be/rust-bert
Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)
sonos/tract
Tiny, no-nonsense, self-contained, Tensorflow and ONNX inference
ryanmcgrath/cacao
Rust bindings for AppKit (macOS) and UIKit (iOS/tvOS). Experimental, but working!
copilot-emacs/copilot.el
An unofficial Copilot plugin for Emacs.
intel/scikit-learn-intelex
Intel(R) Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
awslabs/slapo
A schedule language for large model training
SeanNaren/minGPT
A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!