zheng-ningxin

http://zheng-ningxin.github.io/

zheng-ningxin's Stars

CompVis/stable-diffusion
A latent text-to-image diffusion model
Language:Jupyter Notebook68.9k 561 71710.2k
facebookresearch/llama
Inference code for LLaMA models
Language:Python50.9k 499 8728.7k
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Python39.5k 448 3155.1k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python14.7k 122 1.2k1.4k
NVIDIA/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Language:Jupyter Notebook13.7k 296 8493.3k
git-lfs/git-lfs
Git extension for versioning large files
Language:Go13.1k 485 3.1k2.1k
alexjc/neural-enhance
Super Resolution for images using deep learning.
Language:Python11.9k 398 01.4k
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python10.9k 166 8052.4k
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
Language:Python10.6k 159 65830
nlpxucan/WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
Language:Python9.3k 113 190722
NVIDIA/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Language:Python8.5k 100 1.2k1.4k
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
Language:C++5.9k 63 625896
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++5.9k 109 1.2k1k
lucidrains/DALLE-pytorch
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
Language:Python5.6k 97 277641
bytedance/lightseq
LightSeq: A High Performance Library for Sequence Processing and Generation
Language:C++3.2k 58 287329
microsoft/torchscale
Foundation Architecture for (M)LLMs
Language:Python3k 47 79211
microsoft/Cream
This is a collection of our NAS and Vision Transformer work.
Language:Python1.7k 34 170231
pytorch/torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
Language:Python1k 45 567124
facebookresearch/CompilerGym
Reinforcement learning environments for compiler and program optimization tasks
Language:Python917 34 291130
facebookresearch/LeViT
LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference
Language:Python605 11 2872
grimoire/mmdetection-to-tensorrt
convert mmdetection model to tensorrt, support fp16, int8, batch input, dynamic shape etc.
Language:Python593 14 10785
HarukiYqM/Non-Local-Sparse-Attention
PyTorch code for our paper "Image Super-Resolution with Non-Local Sparse Attention" (CVPR2021).
Language:Python175 3 3320
microsoft/SparTA
Language:Python132 10 1011
nicolaswilde/cuda-tensorcore-hgemm
Language:Cuda117 5 021
Karbo123/pytorch_grouped_gemm
High Performance Grouped GEMM in PyTorch
Language:Cuda23 1 22
tjyuyao/cutex
PyCUDA based PyTorch Extension Made Easy
Language:Python20 1 11
microsoft/Carbon-Insight
A platform to display the carbon neutralization information for researchers, decision-makers, and other participants in the community.
Language:Jupyter Notebook17 6 04
zheng-ningxin/Pruning-from-scratch
Language:Python17 3 13
sjtu-epcc/Laius
The source code of the paper"Laius: Towards Latency Awareness and Improved Utilization of Spatial Multitasking Accelerators in Datacenters" in ICS 2019.
Language:C++8 4 11
andygongyb/SparseTrain
SparseTrain: Leveraging Dynamic Sparsity in Training DNNs on General-Purpose SIMD Processors
Language:C++6 2 01

zheng-ningxin

zheng-ningxin's Stars

CompVis/stable-diffusion

facebookresearch/llama

Stability-AI/stablediffusion

Dao-AILab/flash-attention

NVIDIA/DeepLearningExamples

git-lfs/git-lfs

alexjc/neural-enhance

NVIDIA/Megatron-LM

RUCAIBox/LLMSurvey

nlpxucan/WizardLM

NVIDIA/apex

NVIDIA/FasterTransformer

NVIDIA/cutlass

lucidrains/DALLE-pytorch

bytedance/lightseq

microsoft/torchscale

microsoft/Cream

pytorch/torchdynamo

facebookresearch/CompilerGym

facebookresearch/LeViT

grimoire/mmdetection-to-tensorrt

HarukiYqM/Non-Local-Sparse-Attention

microsoft/SparTA

nicolaswilde/cuda-tensorcore-hgemm

Karbo123/pytorch_grouped_gemm

tjyuyao/cutex

microsoft/Carbon-Insight

zheng-ningxin/Pruning-from-scratch

sjtu-epcc/Laius

andygongyb/SparseTrain