Dinghow

Let life be beautiful like summer flowers~

Lepton AIHangzhou, China

Dinghow's Stars

lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python37.3k 352 1.8k4.6k
microsoft/autogen
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Language:Python36.3k 418 2.2k5.2k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20.8k 158 1.6k2.3k
meta-llama/codellama
Inference code for CodeLlama models
Language:Python16.1k 187 2071.9k
BerriAI/litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Language:Python15.4k 83 3.8k1.8k
triton-lang/triton
Development repository for the Triton language and compiler
Language:C++13.8k 196 1.6k1.7k
THUDM/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
Language:Python13.6k 99 7871.6k
apache/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Language:Python11.9k 377 3.4k3.5k
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python10.9k 166 8052.4k
huggingface/text-generation-inference
Large Language Model Text Generation Inference
Language:Python9.5k 103 1.4k1.1k
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++9k 96 2.1k1k
huggingface/chat-ui
Open source codebase powering the HuggingChat app
Language:TypeScript7.8k 83 6101.2k
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Language:Python6.7k 65 84374
lucidrains/x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
Language:Python4.9k 57 235426
huggingface/safetensors
Simple, safe way to store and distribute tensors
Language:Python3k 44 188200
BBuf/tvm_mlir_learn
compiler learning resources collect.
Language:Python2.2k 36 4338
fangwei123456/spikingjelly
SpikingJelly is an open-source deep learning framework for Spiking Neural Network (SNN) based on PyTorch.
Language:Python1.4k 21 421250
horseee/Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
Language:Python1.4k 41 499
ChenyangQiQi/FateZero
[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"
Language:Jupyter Notebook1.1k 14 35107
Xwin-LM/Xwin-LM
Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment
Language:Python1k 37 2041
sturdy-dev/codereview.gpt
Reviews your Pull/Merge Requests using ChatGPT
Language:JavaScript561 10 2369
ZiyuGuo99/Point-Bind_Point-LLM
Align 3D Point Cloud with Multi-modalities for Large Language Models
Language:Python423 15 1231
meta-math/MetaMath
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
Language:Python397 7 2836
PJLab-ADG/OpenPCSeg
OpenPCSeg: Open Source Point Cloud Segmentation Toolbox and Benchmark
Language:Python380 12 2736
lucidrains/segformer-pytorch
Implementation of Segformer, Attention + MLP neural network for segmentation, in Pytorch
Language:Python345 9 1343
Alibaba-NLP/SeqGPT
SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding
Language:Python215 4 1411
AlibabaResearch/flash-llm
Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity
Language:Cuda186 5 817
zhenyuw16/Uni3DETR
Code release for our NeurIPS 2023 paper "Uni3DETR: Unified 3D Detection Transformer", our ECCV 2024 paper "OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation"
Language:Python88 6 123
AlibabaResearch/recom
An Optimizing Compiler for Recommendation Model Inference
Language:C++22 4 12
luiyen/llm-code-review
A container GitHub Action to review a pull request by HuggingFace's LLM Model.
Language:Python22 1 09

Dinghow

Dinghow's Stars

lm-sys/FastChat

microsoft/autogen

haotian-liu/LLaVA

meta-llama/codellama

BerriAI/litellm

triton-lang/triton

THUDM/ChatGLM3

apache/tvm

NVIDIA/Megatron-LM

huggingface/text-generation-inference

NVIDIA/TensorRT-LLM

huggingface/chat-ui

mit-han-lab/streaming-llm

lucidrains/x-transformers

huggingface/safetensors

BBuf/tvm_mlir_learn

fangwei123456/spikingjelly

horseee/Awesome-Efficient-LLM

ChenyangQiQi/FateZero

Xwin-LM/Xwin-LM

sturdy-dev/codereview.gpt

ZiyuGuo99/Point-Bind_Point-LLM

meta-math/MetaMath

PJLab-ADG/OpenPCSeg

lucidrains/segformer-pytorch

Alibaba-NLP/SeqGPT

AlibabaResearch/flash-llm

zhenyuw16/Uni3DETR

AlibabaResearch/recom

luiyen/llm-code-review