Dinghow's Stars
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
microsoft/autogen
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
meta-llama/codellama
Inference code for CodeLlama models
BerriAI/litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
triton-lang/triton
Development repository for the Triton language and compiler
THUDM/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
apache/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
huggingface/text-generation-inference
Large Language Model Text Generation Inference
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
huggingface/chat-ui
Open source codebase powering the HuggingChat app
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
lucidrains/x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
huggingface/safetensors
Simple, safe way to store and distribute tensors
BBuf/tvm_mlir_learn
compiler learning resources collect.
fangwei123456/spikingjelly
SpikingJelly is an open-source deep learning framework for Spiking Neural Network (SNN) based on PyTorch.
horseee/Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
ChenyangQiQi/FateZero
[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"
Xwin-LM/Xwin-LM
Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment
sturdy-dev/codereview.gpt
Reviews your Pull/Merge Requests using ChatGPT
ZiyuGuo99/Point-Bind_Point-LLM
Align 3D Point Cloud with Multi-modalities for Large Language Models
meta-math/MetaMath
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
PJLab-ADG/OpenPCSeg
OpenPCSeg: Open Source Point Cloud Segmentation Toolbox and Benchmark
lucidrains/segformer-pytorch
Implementation of Segformer, Attention + MLP neural network for segmentation, in Pytorch
Alibaba-NLP/SeqGPT
SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding
AlibabaResearch/flash-llm
Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity
zhenyuw16/Uni3DETR
Code release for our NeurIPS 2023 paper "Uni3DETR: Unified 3D Detection Transformer", our ECCV 2024 paper "OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation"
AlibabaResearch/recom
An Optimizing Compiler for Recommendation Model Inference
luiyen/llm-code-review
A container GitHub Action to review a pull request by HuggingFace's LLM Model.