sgxu's Stars
PaddlePaddle/Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
weiliu89/caffe
Caffe: a fast open framework for deep learning.
kpzhang93/MTCNN_face_detection_alignment
Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Neural Networks
bshoshany/thread-pool
BS::thread_pool: a fast, lightweight, modern, and easy-to-use C++17 / C++20 / C++23 thread pool library
fyu/dilation
Dilated Convolution for Semantic Image Segmentation
tpoisonooo/how-to-optimize-gemm
row-major matmul optimization
daadaada/turingas
Assembler for NVIDIA Volta and Turing GPUs
778477/iOS-LinkMapAnalyzer
解析iOS工程中的linkmap文件,方便分析各个模块占用的包大小
GVProf/GVProf
GVProf: A Value Profiler for GPU-based Clusters
mark1879/DSA
Data Structures & Algorithms implemented by c++
ryanolson/ansible-nvidia-docker
Ansible role to install nvidia-docker
kpzhang93/kpzhang93.github.io