Pinned Repositories
2048_Framework
apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Apply_TD-learning_to_2048
cudnn-frontend
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
docs
Documentations for PaddlePaddle
GPU-Perf-Analyzer
A tool to classify and statistic GPU kernel information.
JAX-Toolbox
JAX-Toolbox
MIST-3D_printer-Trend-Analysis
models
Pre-trained and Reproduced Deep Learning Models (『飞桨』官方模型库,包含多种学术前沿和工业场景验证的深度学习模型)
Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
mingxu1067's Repositories
mingxu1067/GPU-Perf-Analyzer
A tool to classify and statistic GPU kernel information.
mingxu1067/2048_Framework
mingxu1067/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
mingxu1067/Apply_TD-learning_to_2048
mingxu1067/cudnn-frontend
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
mingxu1067/docs
Documentations for PaddlePaddle
mingxu1067/JAX-Toolbox
JAX-Toolbox
mingxu1067/MIST-3D_printer-Trend-Analysis
mingxu1067/models
Pre-trained and Reproduced Deep Learning Models (『飞桨』官方模型库,包含多种学术前沿和工业场景验证的深度学习模型)
mingxu1067/Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
mingxu1067/paddle_allreduce_issues_reproduce
paddle_allreduce_issues_reproduce
mingxu1067/PaddleNLP
An NLP library with Awesome pre-trained Transformer models and easy-to-use interface, supporting wide-range of NLP tasks from research to industrial applications.
mingxu1067/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in both training and inference.