Pinned Repositories
awesome-cpp
A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.
BabelStream
STREAM, for lots of devices written in many programming models
benchmarks
A benchmark framework for Tensorflow
bert
TensorFlow code and pre-trained models for BERT
bots
The Barcelona OpenMP Task Suite
climate-seg-benchmark
Reference implementation for the climate segmentation benchmark, based on the Exascale Deep Learning for Climate Analytics work
CLOC
CL Offline Compiler : Compile OpenCL kernels to HSAIL
cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
DeepBench
Benchmarking Deep Learning operations on different hardware
hiptensorflow
ROCm/HIP enabled Tensorflow.
sunway513's Repositories
sunway513/awesome-cpp
A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.
sunway513/bert
TensorFlow code and pre-trained models for BERT
sunway513/benchmarks
A benchmark framework for Tensorflow
sunway513/climate-seg-benchmark
Reference implementation for the climate segmentation benchmark, based on the Exascale Deep Learning for Climate Analytics work
sunway513/cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
sunway513/DeepLearningExamples
Deep Learning Examples
sunway513/deploy-code-server
sunway513/detectron2
Detectron2 is FAIR's next-generation platform for object detection and segmentation.
sunway513/dlrm
An implementation of a deep learning recommendation model (DLRM)
sunway513/gloo
Collective communications library with various primitives for multi-machine training.
sunway513/HIP
HIP : Convert CUDA to Portable C++ Code
sunway513/Mask_RCNN
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
sunway513/Megatron-LM
Ongoing research training transformer language models at scale, including: BERT & GPT-2
sunway513/misc
sunway513/nccl-rccl-parser
Tool to run rccl-tests/nccl-tests based on from an application
sunway513/nmt
TensorFlow Neural Machine Translation Tutorial
sunway513/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
sunway513/rocBLAS
Next generation BLAS implementation for ROCm platform
sunway513/ROCm
ROCm - Open Source Platform for HPC and Ultrascale GPU Computing
sunway513/rocm-recipes
Recipes for rocm
sunway513/ROCm.github.io
ROCm Website
sunway513/ROCm_Documentation
ROCm Software Platform Documentation
sunway513/ROCmValidationSuite
The ROCm Validation Suite is a system administrator’s and cluster manager's tool for detecting and troubleshooting common problems affecting AMD GPU(s) running in a high-performance computing environment, enabled using the ROCm software stack on a compatible platform.
sunway513/Tensile
Stretching GPU performance for GEMMs and tensor contractions.
sunway513/tensorflow-large-model-support
Large Model Support in Tensorflow
sunway513/tensorflow-upstream
TensorFlow ROCm port
sunway513/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
sunway513/vision
Datasets, Transforms and Models specific to Computer Vision
sunway513/YOLOv3_TensorFlow
Complete YOLO v3 TensorFlow implementation. Support training on your own dataset.
sunway513/yolov5
YOLOv5 in PyTorch > ONNX > CoreML > TFLite