wm901115nwpu

wm901115nwpu's Stars

google/styleguide
Style guides for Google-originated open-source projects
Language:HTML36.8k 1.3k 32513.3k
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python35.2k 344 1.7k4.3k
streamlit/streamlit
Streamlit — A faster way to build and share data apps.
Language:Python32.7k 316 4.2k2.8k
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
Language:Python32.5k 231 4.1k4.5k
FMInference/FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
Language:Python9k 108 80527
rocky/python-uncompyle6
A cross-version Python bytecode decompiler
Language:Python3.6k 78 334402
facebook/buck2
Build system, successor to Buck
Language:Rust3.4k 58 361198
zrax/pycdc
C++ python bytecode disassembler and decompiler
Language:C++2.9k 93 368586
sgl-project/sglang
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.
Language:Python2.6k 29 254164
huggingface/safetensors
Simple, safe way to store and distribute tensors
Language:Python2.5k 41 158155
google/maxtext
A simple, performant and scalable Jax LLM!
Language:Python1.3k 23 62229
godweiyang/NN-CUDA-Example
Several simple examples for popular neural network toolkits calling custom CUDA operators.
Language:Python1.2k 8 13183
pytorch/torchtitan
A native PyTorch Library for large model training
Language:Python1.2k 29 87111
databricks/megablocks
Language:Python1.1k 17 48153
rocky/python-decompile3
Python decompiler for 3.7-3.8 Stripped down from uncompyle6 so we can refactor and start to fix up some long-standing problems
Language:Python1k 49 84151
bat67/pytorch-tutorials-examples-and-books
PyTorch tutorials, examples and some books I found 【不定期更新】整理的PyTorch 最新版教程、例子和书籍
Language:Jupyter Notebook926 24 1255
NVIDIA/cccl
CUDA C++ Core Libraries
Language:C++874 30 1k112
mustvlad/ChatGPT-System-Prompts
This repository contains a collection of the best system prompts for ChatGPT, a conversational AI model developed by OpenAI. Star this repository to help us reach 5,000 stars!
626 11 163
codecaution/Awesome-Mixture-of-Experts-Papers
A curated reading list of research in Mixture-of-Experts(MoE).
479 14 338
IST-DASLab/marlin
FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.
Language:Python398 13 2130
BobaZooba/xllm
🦖 X—LLM: Cutting Edge & Easy LLM Finetuning
Language:Python358 3 1121
sail-sg/zero-bubble-pipeline-parallelism
Zero Bubble Pipeline Parallelism
Language:Python201 5 129
microsoft/BitBLAS
BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
Language:Python148 11 914
microsoft/triton-shared
Shared Middle-Layer for Triton Compilation
Language:MLIR117 9 5325
hpcaitech/TensorNVMe
A Python library transfers PyTorch tensors between CPU and NVMe
Language:C++82 6 218
ROCm/triton
Development repository for the Triton language and compiler
Language:C++76 8 8019
Jokeren/GPA
GPU Performance Advisor
Language:Python55 5 48
GVProf/GVProf
GVProf: A Value Profiler for GPU-based Clusters
Language:Python43 6 239
ModelTC/TFMQ-DM
[CVPR 2024 Highlight] TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models
Language:Jupyter Notebook28 10 03
Hzfengsy/asplos-tvm
Language:Python11 3 20