xingjinglu

HPC & Platform & Tools for AI

xingjinglu's Stars

AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Language:Python144k 1.1k 7.7k27k
CompVis/stable-diffusion
A latent text-to-image diffusion model
Language:Jupyter Notebook68.6k 559 71610.2k
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Language:Python37.1k 431 1.6k3.2k
OAI/OpenAPI-Specification
The OpenAPI Specification Repository
Language:Markdown29.1k 843 2.3k9.1k
facebook/folly
An open-source C++ library developed and used at Facebook.
Language:C++28.5k 1k 1.2k5.6k
apple/ml-stable-diffusion
Stable Diffusion with Core ML on Apple Silicon
Language:Python17k 142 252950
triton-lang/triton
Development repository for the Triton language and compiler
Language:C++13.6k 197 1.5k1.7k
temporalio/temporal
Temporal service
Language:Go12.2k 102 1.3k851
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.7k 77 569622
gperftools/gperftools
Main gperftools repository
Language:C++8.5k 362 1.3k1.5k
THUDM/GLM-130B
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Language:Python7.7k 99 198608
facebookincubator/AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Language:Python4.6k 82 244372
oneapi-src/oneDNN
oneAPI Deep Neural Network Library (oneDNN)
Language:C++3.6k 180 1.3k1k
sovrasov/flops-counter.pytorch
Flops counter for convolutional networks in pytorch framework
Language:Python2.8k 15 98306
herumi/xbyak
A JIT assembler for x86/x64 architectures supporting MMX, SSE (1-4), AVX (1-2, 512), FPU, APX, and AVX10.2
Language:C++2.1k 115 94277
microsoft/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python1.9k 24 182344
skarupke/flat_hash_map
A very fast hashtable
Language:C++1.7k 73 41185
pytorch/torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
Language:Python1k 45 567124
python/pyperformance
Python Performance Benchmark Suite
Language:Python870 61 117176
stochasticai/x-stable-diffusion
Real-time inference for Stable Diffusion - 0.88s latency. Covers AITemplate, nvFuser, TensorRT, FlashAttention. Join our Discord communty: https://discord.com/invite/TgHXuSJEk6
Language:Jupyter Notebook553 14 2135
ROCm/composable_kernel
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
Language:C++321 25 229129
amirgholami/ZeroQ
[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework
Language:Python274 17 2656
kssteven418/I-BERT
[ICML'21 Oral] I-BERT: Integer-only BERT Quantization
Language:Python230 4 3132
WoosukKwon/retraining-free-pruning
[NeurIPS 2022] A Fast Post-Training Pruning Framework for Transformers
Language:Python173 6 1927
zengkid/pdf-books
:books: PDF 书籍库
Language:Java122 0 059
astojanov/Clover
Clover: Quantized 4-bit Linear Algebra Library
Language:C++110 5 15
kssteven418/LTP
[KDD'22] Learned Token Pruning for Transformers
Language:Python93 3 1017
renzibei/fph-table
Flash Perfect Hash Table: an implementation of a dynamic perfect hash table, extremely fast for lookup
Language:C++42 3 55
masahi/tvm-winograd
Test winograd convolution written in TVM for CUDA and AMDGPU
Language:Python40 4 02
kssteven418/Q-ASR
[ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition
Language:Jupyter Notebook30 5 12

xingjinglu

xingjinglu's Stars

AUTOMATIC1111/stable-diffusion-webui

CompVis/stable-diffusion

LAION-AI/Open-Assistant

OAI/OpenAPI-Specification

facebook/folly

apple/ml-stable-diffusion

triton-lang/triton

temporalio/temporal

facebookresearch/xformers

gperftools/gperftools

THUDM/GLM-130B

facebookincubator/AITemplate

oneapi-src/oneDNN

sovrasov/flops-counter.pytorch

herumi/xbyak

microsoft/Megatron-DeepSpeed

skarupke/flat_hash_map

pytorch/torchdynamo

python/pyperformance

stochasticai/x-stable-diffusion

ROCm/composable_kernel

amirgholami/ZeroQ

kssteven418/I-BERT

WoosukKwon/retraining-free-pruning

zengkid/pdf-books

astojanov/Clover

kssteven418/LTP

renzibei/fph-table

masahi/tvm-winograd

kssteven418/Q-ASR