Pinned Repositories
AI-benchmark
apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
benchmarks
Benchmark code
blocksparse
Efficient GPU kernels for block-sparse matrix multiplication and convolution
caffe
Caffe: a fast open framework for deep learning.
caffe-parallel
Cpp_Primer_Answers
《C++ Primer》第五版中文版习题答案
CUDA-Programming-Guide-in-Chinese
This is a Chinese translation of the CUDA programming guide
Deep-Learning-Papers-Reading-Roadmap
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
Deep_learning
深度学习
Agoniii's Repositories
Agoniii/AI-benchmark
Agoniii/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Agoniii/benchmarks
Benchmark code
Agoniii/blocksparse
Efficient GPU kernels for block-sparse matrix multiplication and convolution
Agoniii/caffe
Caffe: a fast open framework for deep learning.
Agoniii/caffe-parallel
Agoniii/Cpp_Primer_Answers
《C++ Primer》第五版中文版习题答案
Agoniii/CUDA-Programming-Guide-in-Chinese
This is a Chinese translation of the CUDA programming guide
Agoniii/Deep-Learning-Papers-Reading-Roadmap
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
Agoniii/Deep_learning
深度学习
Agoniii/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Agoniii/how-to-optimize-gemm
Agoniii/HPCInfo
Information about many aspects of high-performance computing. There is a lot of content in the Wiki.
Agoniii/hvd-test
Agoniii/llm_training_tools
Agoniii/Megatron-LM
Ongoing research training transformer models at scale
Agoniii/nccl-fastsocket
NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.
Agoniii/NeMo
NeMo: a toolkit for conversational AI
Agoniii/NeMo-Megatron-Launcher
NeMo Megatron launcher and tools
Agoniii/PipeCNN
An OpenCL-based FPGA Accelerator for Convolutinal Neural Networks
Agoniii/seastar
High performance server-side application framework
Agoniii/tensorboard
TensorFlow's Visualization Toolkit
Agoniii/tensorflow
An Open Source Machine Learning Framework for Everyone
Agoniii/tensorflow-101
learn code with tensorflow
Agoniii/tensorflow-tutorial
TensorFlow and Deep Learning Tutorials
Agoniii/tensorflow-zh
谷歌全新开源人工智能系统TensorFlow官方文档中文版
Agoniii/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Agoniii/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.