qwerfdsadad

Pinned Repositories

DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python36k 347 2.9k4.2k
DeepSpeedExamples
Example models using DeepSpeed
Language:Python6.2k 75 5431.1k
haq
[CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision
Language:Python376 20 2386
stl1weekend
Build your own STL in one weekend
Language:C++244 2 919
PDEBench
PDEBench: An Extensive Benchmark for Scientific Machine Learning
Language:Python793 17 6193
aima-python
Python implementation of algorithms from Russell And Norvig's "Artificial Intelligence - A Modern Approach"
Language:Jupyter Notebook0 0 00
annotated_deep_learning_paper_implementations
🧑‍🏫 59 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Jupyter Notebook0 0 00
cumf_sgd
CUDA Matrix Factorization Library with Stochastic Gradient Descent (SGD)
Language:C++0 0 00
How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
Language:Cuda0 0 00
TurboBFS
A highly scalable GPU-based set of top-down and bottom-up BFS algorithms in the language of linear algebra.
Language:C0 0 00

qwerfdsadad's Repositories

qwerfdsadad/aima-python
Python implementation of algorithms from Russell And Norvig's "Artificial Intelligence - A Modern Approach"
Language:Jupyter Notebook0 0 00
qwerfdsadad/annotated_deep_learning_paper_implementations
🧑‍🏫 59 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Jupyter Notebook0 0 00
qwerfdsadad/cumf_sgd
CUDA Matrix Factorization Library with Stochastic Gradient Descent (SGD)
Language:C++0 0 00
qwerfdsadad/How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
Language:Cuda0 0 00
qwerfdsadad/TurboBFS
A highly scalable GPU-based set of top-down and bottom-up BFS algorithms in the language of linear algebra.
Language:C0 0 00