zheng-ningxin's Stars
CompVis/stable-diffusion
A latent text-to-image diffusion model
facebookresearch/llama
Inference code for LLaMA models
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
NVIDIA/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
git-lfs/git-lfs
Git extension for versioning large files
alexjc/neural-enhance
Super Resolution for images using deep learning.
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
nlpxucan/WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
NVIDIA/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
lucidrains/DALLE-pytorch
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
bytedance/lightseq
LightSeq: A High Performance Library for Sequence Processing and Generation
microsoft/torchscale
Foundation Architecture for (M)LLMs
microsoft/Cream
This is a collection of our NAS and Vision Transformer work.
pytorch/torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
facebookresearch/CompilerGym
Reinforcement learning environments for compiler and program optimization tasks
facebookresearch/LeViT
LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference
grimoire/mmdetection-to-tensorrt
convert mmdetection model to tensorrt, support fp16, int8, batch input, dynamic shape etc.
HarukiYqM/Non-Local-Sparse-Attention
PyTorch code for our paper "Image Super-Resolution with Non-Local Sparse Attention" (CVPR2021).
microsoft/SparTA
nicolaswilde/cuda-tensorcore-hgemm
Karbo123/pytorch_grouped_gemm
High Performance Grouped GEMM in PyTorch
tjyuyao/cutex
PyCUDA based PyTorch Extension Made Easy
microsoft/Carbon-Insight
A platform to display the carbon neutralization information for researchers, decision-makers, and other participants in the community.
zheng-ningxin/Pruning-from-scratch
sjtu-epcc/Laius
The source code of the paper"Laius: Towards Latency Awareness and Improved Utilization of Spatial Multitasking Accelerators in Datacenters" in ICS 2019.
andygongyb/SparseTrain
SparseTrain: Leveraging Dynamic Sparsity in Training DNNs on General-Purpose SIMD Processors