Pinned Repositories
AddressBook
Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
ColossalAI
Colossal-AI: A Unified Deep Learning System for Large-Scale Parallel Training
DeepLearningExamples
Deep Learning Examples
differentNotice
diffusers
OneFlow fork of 🤗 Diffusers
djz
dpm-solver
Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)
dq-bart
DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization (ACL 2022)
text-generation-inference
Large Language Model Text Generation Inference
dingjingzhen's Repositories
dingjingzhen/text-generation-inference
Large Language Model Text Generation Inference
dingjingzhen/AddressBook
dingjingzhen/Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
dingjingzhen/ColossalAI
Colossal-AI: A Unified Deep Learning System for Large-Scale Parallel Training
dingjingzhen/DeepLearningExamples
Deep Learning Examples
dingjingzhen/differentNotice
dingjingzhen/diffusers
OneFlow fork of 🤗 Diffusers
dingjingzhen/djz
dingjingzhen/dpm-solver
Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)
dingjingzhen/dq-bart
DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization (ACL 2022)
dingjingzhen/DsbyLiteExample
dingjingzhen/EET
dingjingzhen/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
dingjingzhen/FasterTransformer
Transformer related optimization, including BERT, GPT
dingjingzhen/flash-attention
Fast and memory-efficient exact attention
dingjingzhen/GCD
dingjingzhen/JNeRF
JNeRF is a NeRF benchmark based on Jittor. JNeRF re-implemented instant-ngp and achieved same performance with original paper.
dingjingzhen/js-img-lunbo
dingjingzhen/mygit
dingjingzhen/navigation
navigationbar的各种问题
dingjingzhen/openresty-systemtap-toolkit
Real-time analysis and diagnostics tools for OpenResty (including NGINX, LuaJIT, ngx_lua, and more) based on SystemTap
dingjingzhen/RAPiD
RAPiD: Rotation-Aware People Detection in Overhead Fisheye Images (CVPR 2020 Workshops)
dingjingzhen/smooth-sampler
Trilinear sampler with smoothstep and double backpropagation
dingjingzhen/stable-diffusion
A latent text-to-image diffusion model
dingjingzhen/TensorRt-8.4.0.6
dingjingzhen/test
测试用的
dingjingzhen/TPAT
TensorRT Plugin Autogen Tool
dingjingzhen/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
dingjingzhen/TreeTableview