chenchu-zs

@AlibabaHangzhou, China

chenchu-zs's Stars

hydro-dev/Hydro
Hydro - Next generation high performance online-judge platform - 新一代高效强大的信息学在线测评系统 (a.k.a. vj5)
Language:TypeScript4.4k332
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Language:Python145k27.3k
jonas/tig
Text-mode interface for git
Language:C12.5k620
wolfpld/tracy
Frame profiler
Language:C++10.5k708
PaddlePaddle/Serving
A flexible, high-performance carrier for machine learning models（『飞桨』服务化部署框架）
Language:C++901251
borgwang/tinynn
A lightweight deep learning library
Language:Python37993
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python39k4.3k
dpilger26/NumCpp
C++ implementation of the Python Numpy library
Language:C++3.6k554
facebookincubator/gloo
Collective communications library with various primitives for multi-machine training.
Language:C++1.2k304
hanickadot/compile-time-regular-expressions
Compile Time Regular Expression in C++
Language:C++3.4k188
mixmark-io/turndown
🛏 An HTML to Markdown converter written in JavaScript
Language:HTML9.1k888
apuaaChen/vectorSparse
Language:Cuda3212
microsoft/mimalloc
mimalloc is a compact general purpose allocator with excellent performance.
Language:C10.8k888
jemalloc/jemalloc
Language:C9.7k1.5k
google/tcmalloc
Language:C++4.5k486
ELS-RD/transformer-deploy
Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀
Language:Python1.7k151
openai/openai-gemm
Open single and half precision gemm implementations
Language:C37385
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++5.9k1k
gyscos/cursive
A Text User Interface library for the Rust programming language
Language:Rust4.4k250
actor-framework/actor-framework
An Open Source Implementation of the Actor Model in C++
Language:C++3.2k546
microsoft/nnfusion
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
Language:C++972162
bytedance/byteps
A high performance and generic framework for distributed DNN training
Language:Python3.6k491
deeperlearning/professional-cuda-c-programming
Language:Cuda399156
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python36.1k4.2k
NVIDIA/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Language:Python8.5k1.4k
herumi/xbyak
A JIT assembler for x86/x64 architectures supporting MMX, SSE (1-4), AVX (1-2, 512), FPU, APX, and AVX10.2
Language:C++2.1k276
ccache/ccache
ccache – a fast compiler cache
Language:C++2.4k506
bytedance/lightseq
LightSeq: A High Performance Library for Sequence Processing and Generation
Language:C++3.2k329
mindspore-ai/mindspore
MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.
Language:C++4.4k714
NixOS/patchelf
A small utility to modify the dynamic linker and RPATH of ELF executables
Language:C3.7k488

chenchu-zs

chenchu-zs's Stars

hydro-dev/Hydro

AUTOMATIC1111/stable-diffusion-webui

jonas/tig

wolfpld/tracy

PaddlePaddle/Serving

borgwang/tinynn

hpcaitech/ColossalAI

dpilger26/NumCpp

facebookincubator/gloo

hanickadot/compile-time-regular-expressions

mixmark-io/turndown

apuaaChen/vectorSparse

microsoft/mimalloc

jemalloc/jemalloc

google/tcmalloc

ELS-RD/transformer-deploy

openai/openai-gemm

NVIDIA/cutlass

gyscos/cursive

actor-framework/actor-framework

microsoft/nnfusion

bytedance/byteps

deeperlearning/professional-cuda-c-programming

microsoft/DeepSpeed

NVIDIA/apex

herumi/xbyak

ccache/ccache

bytedance/lightseq

mindspore-ai/mindspore

NixOS/patchelf