Pinned Repositories
0xDeCA10B
Decentralized & Collaborative AI on Blockchain
benchmark
book
caffe2
Caffe2 is a lightweight, modular, and scalable deep learning framework.
cloud
PaddleCloud distributed training job scheduling
code_samples
cudnn_conv_int8
Testing INT8 convolution on cuDNN
CycleGAN
Tensorflow implementation of CycleGAN
dist_inf
Paddle
PArallel Distributed Deep LEarning
wanghaoshuang's Repositories
wanghaoshuang/Paddle
PArallel Distributed Deep LEarning
wanghaoshuang/cudnn_conv_int8
Testing INT8 convolution on cuDNN
wanghaoshuang/0xDeCA10B
Decentralized & Collaborative AI on Blockchain
wanghaoshuang/benchmark
wanghaoshuang/code_samples
wanghaoshuang/dist_inf
wanghaoshuang/FasterTransformer
Transformer related optimization, including BERT, GPT
wanghaoshuang/FlashMLA
wanghaoshuang/FleetX
Paddle Distributed Training Examples. 飞桨分布式训练示例 Resnet Bert GPT MOE DataParallel ModelParallel PipelineParallel HybridParallel AutoParallel Zero Sharding Recompute GradientMerge Offload AMP DGC LocalSGD Wide&Deep
wanghaoshuang/FluidDoc
wanghaoshuang/graphs
wanghaoshuang/insightface
Face Analysis Project on MXNet
wanghaoshuang/models
Model configurations
wanghaoshuang/nccl-tests
NCCL Tests
wanghaoshuang/paddle-ce-latest-kpis
Paddle Continuous Evaluation, keep updating.
wanghaoshuang/Paddle-Lite
Multi-platform high performance deep learning inference engine (『飞桨』多平台高性能深度学习预测引擎)
wanghaoshuang/PaddleClas
A treasure chest for image classification powered by PaddlePaddle
wanghaoshuang/PaddleDetection
A high performance object detection toolkit based on PaddlePaddle.
wanghaoshuang/PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
wanghaoshuang/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (multilingual recognition: English, Chinese, Korean, Japanese, German, French etc. 3.5M practical ultra lightweight OCR system, support training and deployment among server, mobile, embedded and IoT devices)
wanghaoshuang/PaddleSeg
A high performance semantic segmentation toolkit based on PaddlePaddle. (『飞桨』图像分割库)
wanghaoshuang/PaddleSlim
PaddleSlim is an open-source library for deep model compression and architecture search.
wanghaoshuang/PaddleX
PaddlePaddle End-to-End Development Toolkit(『飞桨』深度学习全流程开发工具)
wanghaoshuang/pytorch-cifar100
Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet, NasNet, Residual Attention Network, SENet)
wanghaoshuang/pytorch_resnet_cifar10
Proper implementation of ResNet-s for CIFAR10/100 in pytorch that matches description of the original paper.
wanghaoshuang/SlimKernels
wanghaoshuang/tpu
Reference models and tools for Cloud TPUs.
wanghaoshuang/triton
Development repository for the Triton language and compiler
wanghaoshuang/unsloth
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
wanghaoshuang/wanghaoshuang.github.io