Dinghow

Let life be beautiful like summer flowers~

Lepton AIHangzhou, China

Dinghow's Stars

ollama/ollama
Get up and running with Llama 3.3, Phi 4, Gemma 2, and other large language models.
Language:Go107k 617 5.4k8.6k
xai-org/grok-1
Grok open release
Language:Python49.8k 597 2148.3k
state-spaces/mamba
Mamba SSM architecture
Language:Python13.8k 101 5831.2k
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.9k 154 3681k
vosen/ZLUDA
CUDA on non-NVIDIA GPUs
Language:Rust10.3k 141 183671
QwenLM/Qwen2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell7.5k 42 772460
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python6.7k 45 83597
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
Language:Python5.3k 39 42517
QwenLM/Qwen-Agent
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
Language:Python5.2k 43 407447
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Language:Python4.5k 27 586474
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉
3.2k 105 6214
project-baize/baize-chatbot
Let ChatGPT teach your own chatbot in hours with a single GPU!
Language:Python3.2k 49 53287
dvmazur/mixtral-offloading
Run Mixtral-8x7B models in Colab or consumer desktops
Language:Python2.3k 28 28227
flashinfer-ai/flashinfer
FlashInfer: Kernel Library for LLM Serving
Language:Cuda1.8k 21 160177
Vchitect/Latte
Latte: Latent Diffusion Transformer for Video Generation.
Language:Python1.8k 23 106183
databricks/megablocks
Language:Python1.2k 16 62177
NVIDIA/cuda-python
CUDA Python: Performance meets Productivity
Language:Python1k 37 24687
SafeAILab/EAGLE
Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)
Language:Python906 13 15794
mit-han-lab/distrifuser
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
Language:Python636 9 2425
chujiezheng/chat_templates
Chat Templates for 🤗 HuggingFace Large Language Models
Language:Jinja575 9 1554
bytedance/ByteTransformer
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
Language:C++466 10 1038
SkunkworksAI/hydra-moe
Language:Python413 23 1115
InternLM/Agent-FLAN
[ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models
335 4 1110
efeslab/Atom
[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
Language:Cuda286 10 2025
Q-Future/Q-Bench
①[ICLR2024 Spotlight] (GPT-4V/Gemini-Pro/Qwen-VL-Plus+16 OS MLLMs) A benchmark for multi-modality LLMs (MLLMs) on low-level vision and visual quality assessment.
Language:Jupyter Notebook251 1 1213
hemingkx/Spec-Bench
Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)
Language:Python212 3 1622
thunlp/MoEfication
Language:Python116 8 910
stanford-futuredata/stk
Language:Python95 3 620
xmed-lab/C2RV-CBCT
CVPR 2024, "C^2RV: Cross-Regional and Cross-View Learning for Sparse-View CBCT Reconstruction"
Language:Python32 6 43
ruyue0001/Backdoor_DPR
Code for "Backdoor Attacks on Dense Passage Retrievers for Disseminating Misinformation"
Language:Python7 2 11

Dinghow

Dinghow's Stars

ollama/ollama

xai-org/grok-1

state-spaces/mamba

PKU-YuanGroup/Open-Sora-Plan

vosen/ZLUDA

QwenLM/Qwen2

facebookresearch/DiT

google/gemma_pytorch

QwenLM/Qwen-Agent

open-compass/opencompass

DefTruth/Awesome-LLM-Inference

project-baize/baize-chatbot

dvmazur/mixtral-offloading

flashinfer-ai/flashinfer

Vchitect/Latte

databricks/megablocks

NVIDIA/cuda-python

SafeAILab/EAGLE

mit-han-lab/distrifuser

chujiezheng/chat_templates

bytedance/ByteTransformer

SkunkworksAI/hydra-moe

InternLM/Agent-FLAN

efeslab/Atom

Q-Future/Q-Bench

hemingkx/Spec-Bench

thunlp/MoEfication

stanford-futuredata/stk

xmed-lab/C2RV-CBCT

ruyue0001/Backdoor_DPR