chenchu-zs's Stars
hydro-dev/Hydro
Hydro - Next generation high performance online-judge platform - 新一代高效强大的信息学在线测评系统 (a.k.a. vj5)
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
jonas/tig
Text-mode interface for git
wolfpld/tracy
Frame profiler
PaddlePaddle/Serving
A flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架)
borgwang/tinynn
A lightweight deep learning library
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
dpilger26/NumCpp
C++ implementation of the Python Numpy library
facebookincubator/gloo
Collective communications library with various primitives for multi-machine training.
hanickadot/compile-time-regular-expressions
Compile Time Regular Expression in C++
mixmark-io/turndown
🛏 An HTML to Markdown converter written in JavaScript
apuaaChen/vectorSparse
microsoft/mimalloc
mimalloc is a compact general purpose allocator with excellent performance.
jemalloc/jemalloc
google/tcmalloc
ELS-RD/transformer-deploy
Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀
openai/openai-gemm
Open single and half precision gemm implementations
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
gyscos/cursive
A Text User Interface library for the Rust programming language
actor-framework/actor-framework
An Open Source Implementation of the Actor Model in C++
microsoft/nnfusion
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
bytedance/byteps
A high performance and generic framework for distributed DNN training
deeperlearning/professional-cuda-c-programming
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
NVIDIA/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
herumi/xbyak
A JIT assembler for x86/x64 architectures supporting MMX, SSE (1-4), AVX (1-2, 512), FPU, APX, and AVX10.2
ccache/ccache
ccache – a fast compiler cache
bytedance/lightseq
LightSeq: A High Performance Library for Sequence Processing and Generation
mindspore-ai/mindspore
MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.
NixOS/patchelf
A small utility to modify the dynamic linker and RPATH of ELF executables