Dinghow

Let life be beautiful like summer flowers~

Lepton AIHangzhou, China

Dinghow's Stars

meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.6k 230 2733.2k
mozilla/sccache
Sccache is a ccache-like tool. It is used as a compiler wrapper and avoids compilation when possible. Sccache has the capability to utilize caching in remote storage environments, including various cloud storage options, or alternatively, in local storage.
Language:Rust5.9k 56 876552
XuehaiPan/nvitop
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
Language:Python4.9k 25 90157
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
3.8k 32 91173
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Language:Jupyter Notebook3.7k 42 182319
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Language:Python2.6k 25 189215
xusenlinzy/api-for-open-llm
Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc. 开源大模型的统一后端接口
Language:Python2.4k 24 287273
kvcache-ai/Mooncake
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
Language:C++2.3k 27 28128
casper-hansen/AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
Language:Python1.8k 15 423222
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
Language:Cuda1.8k 30 3182
Xmader/aria-ng-gui
一个 Aria2 图形界面客户端 | An Aria2 GUI for Windows & Linux & MacOS
Language:JavaScript1.8k 35 59157
Thinklab-SJTU/Bench2Drive
[NeurIPS 2024 Datasets and Benchmarks Track] Closed-Loop E2E-AD Benchmark Enhanced by World Model RL Expert
Language:Python1.3k 18 12685
szymanowiczs/splatter-image
Official implementation of `Splatter Image: Ultra-Fast Single-View 3D Reconstruction' CVPR 2024
Language:Python892 25 6069
microsoft/MInference
[NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.
Language:Python850 6 7039
hemingkx/SpeculativeDecodingPapers
📰 Must-read papers and blogs on Speculative Decoding ⚡️
529 22 426
xingyaoww/code-act
Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.
Language:Python522 6 1040
mit-han-lab/qserve
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
Language:Python467 8 3628
FloridSleeves/LLMDebugger
LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step
Language:Python456 6 1445
LLMServe/DistServe
Disaggregated serving system for Large Language Models (LLMs).
Language:Jupyter Notebook397 5 4550
HKUNLP/ChunkLlama
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
Language:Python371 7 2119
OpenDriveLab/ELM
[ECCV 2024] Embodied Understanding of Driving Scenarios
Language:Python161 12 1913
zyc00/Point-SAM
Point-SAM: This is the official repository of "Point-SAM: Promptable 3D Segmentation Model for Point Clouds". We provide codes for running our demo and links to download checkpoints.
Language:Python150 6 129
Glaciohound/LM-Infinite
Implementation of NAACL 2024 Outstanding Paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
Language:Python132 4 1313
zhihao-lin/3dgcn
Convolution in the Cloud: Learning Deformable Kernels in 3D Graph Convolution Networks for Point Cloud Analysis
Language:Python123 4 1116
OpenRobotLab/Grounded_3D-LLM
Code&Data for Grounded 3D-LLM with Referent Tokens
Language:Python94 6 92
linhaojia13/PointMetaBase
This is a PyTorch implementation of PointMetaBase proposed by our paper "Meta Architecure for Point Cloud Analysis"
Language:Python89 4 209
AlibabaPAI/FLASHNN
Language:Python79 10 28
alibaba/llm-scheduling-artifact
Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“
Language:Python57 4 22
xmed-lab/DIF-Gaussian
MICCAI 2024: Learning 3D Gaussians for Extremely Sparse-View Cone-Beam CT Reconstruction
Language:Python44 1 33
PanZaifeng/RecFlex
A recommendation model kernel optimizing system
Language:Python80

Dinghow

Dinghow's Stars

meta-llama/llama3

mozilla/sccache

XuehaiPan/nvitop

deepseek-ai/DeepSeek-V2

Tencent/HunyuanDiT

mit-han-lab/llm-awq

xusenlinzy/api-for-open-llm

kvcache-ai/Mooncake

casper-hansen/AutoAWQ

HazyResearch/ThunderKittens

Xmader/aria-ng-gui

Thinklab-SJTU/Bench2Drive

szymanowiczs/splatter-image

microsoft/MInference

hemingkx/SpeculativeDecodingPapers

xingyaoww/code-act

mit-han-lab/qserve

FloridSleeves/LLMDebugger

LLMServe/DistServe

HKUNLP/ChunkLlama

OpenDriveLab/ELM

zyc00/Point-SAM

Glaciohound/LM-Infinite

zhihao-lin/3dgcn

OpenRobotLab/Grounded_3D-LLM

linhaojia13/PointMetaBase

AlibabaPAI/FLASHNN

alibaba/llm-scheduling-artifact

xmed-lab/DIF-Gaussian

PanZaifeng/RecFlex