liutongxuan

AlibabaBeijing

liutongxuan's Stars

huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python138k 1.1k 16.5k27.6k
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Language:TypeScript59.5k 403 5.9k8.8k
microsoft/autogen
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Language:Python38k 421 2.3k5.5k
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Language:Python27.2k 212 4.4k5.6k
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda25.1k 254 1422.9k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python21.2k 158 1.6k2.3k
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
Language:Python19.7k 181 1.4k1.6k
ItzCrazyKns/Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
Language:TypeScript19k 136 3791.8k
phidatahq/phidata
Build multi-modal Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.
Language:Python18.1k 124 3842.4k
openai/swarm
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Language:Python18k 291 111.9k
triton-lang/triton
Development repository for the Triton language and compiler
Language:C++14.1k 198 1.6k1.7k
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook13.7k 80 4381.4k
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python13.1k 107 617914
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++9.2k 98 2.2k1.1k
ccfddl/ccf-deadlines
⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
Language:Vue6.8k 21 91464
facebookincubator/AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Language:Python4.6k 82 244373
xlang-ai/OpenAgents
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
Language:Python4.1k 47 100458
openxla/xla
A machine learning compiler for GPUs, CPUs, and ML accelerators
Language:C++2.9k 42 394480
flashinfer-ai/flashinfer
FlashInfer: Kernel Library for LLM Serving
Language:Cuda1.8k 21 177183
Xilinx/Vitis-AI
Vitis AI is Xilinx’s development stack for AI inference on Xilinx hardware platforms, including both edge devices and Alveo cards.
Language:Python1.5k 78 1.4k640
lucidrains/self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
Language:Python1.4k 23 1873
punica-ai/punica
Serving multiple LoRA finetuned LLM as one
Language:Python1k 12 3946
lucidrains/transfusion-pytorch
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
Language:Python901 35 2839
huangwl18/VoxPoser
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
Language:Python623 9 2882
vectorch-ai/ScaleLLM
A high-performance inference system for large language models, designed for production environments.
Language:C++403 16 7630
Bruce-Lee-LY/cuda_hgemm
Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.
Language:Cuda333 4 1469
microsoft/vattention
Dynamic Memory Management for Serving LLMs without PagedAttention
Language:C273 5 1120
HazyResearch/lolcats
Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"
Language:Python204 20 821
SaraGhazanfari/EMMA
Language:Python9 1 21
vectorch-ai/LLMBench
A library for validating and benchmarking LLMs inference.
Language:Python4 2 11

liutongxuan

liutongxuan's Stars

huggingface/transformers

langgenius/dify

microsoft/autogen

huggingface/diffusers

karpathy/llm.c

haotian-liu/LLaVA

mlc-ai/mlc-llm

ItzCrazyKns/Perplexica

phidatahq/phidata

openai/swarm

triton-lang/triton

facebookresearch/sam2

OpenBMB/MiniCPM-V

NVIDIA/TensorRT-LLM

ccfddl/ccf-deadlines

facebookincubator/AITemplate

xlang-ai/OpenAgents

openxla/xla

flashinfer-ai/flashinfer

Xilinx/Vitis-AI

lucidrains/self-rewarding-lm-pytorch

punica-ai/punica

lucidrains/transfusion-pytorch

huangwl18/VoxPoser

vectorch-ai/ScaleLLM

Bruce-Lee-LY/cuda_hgemm

microsoft/vattention

HazyResearch/lolcats

SaraGhazanfari/EMMA

vectorch-ai/LLMBench