wplf

恒玄科技北京

Pinned Repositories

CMU-10-714
CMU 10-714 Deep-Learning-Systems
Language:Jupyter Notebook0 1 00
Compass_Optimizer
Compass Optimizer (OPT for short), is part of the Zhouyi Compass Neural Network Compiler. The OPT is designed for converting the float Intermediate Representation (IR) generated by the Compass Unified Parser to an optimized quantized or mixed IR which is suited for Zhouyi NPU hardware platforms.
Language:Python0 0 00
Compass_Unified_Parser
armchina NPU parser
Language:Python0 0 00
Competitive_Programming
WPLF template
0 1 00
MIT-6.031-Software-Construction
The record of learning 6.031
Language:Java0 1 00
MIT_6.5940
MIT open course, efficient ML
Language:Jupyter Notebook2 1 00
my-CS-road
2 1 00
py_tutorial
Language:Jupyter Notebook1 1 00
UCB-CS161-sp24
Language:Go1 1 00
UCB-CS61c-2020summer
Language:C0 1 00

wplf's Repositories

wplf/MIT_6.5940
MIT open course, efficient ML
Language:Jupyter Notebook2 1 00
wplf/my-CS-road
2 1 00
wplf/py_tutorial
Language:Jupyter Notebook1 1 00
wplf/UCB-CS161-sp24
Language:Go1 1 00
wplf/CMU-10-714
CMU 10-714 Deep-Learning-Systems
Language:Jupyter Notebook0 1 00
wplf/Compass_Optimizer
Compass Optimizer (OPT for short), is part of the Zhouyi Compass Neural Network Compiler. The OPT is designed for converting the float Intermediate Representation (IR) generated by the Compass Unified Parser to an optimized quantized or mixed IR which is suited for Zhouyi NPU hardware platforms.
Language:Python0 0 00
wplf/Compass_Unified_Parser
armchina NPU parser
Language:Python0 0 00
wplf/Competitive_Programming
WPLF template
0 1 00
wplf/cs-self-learning
计算机自学指南
Language:HTML00
wplf/how-to-optimize-gemm
row-major matmul optimization
Language:C++0 0 00
wplf/How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
Language:Cuda0 0 00
wplf/lzzplus2x
lzzkmc_wplf_changed
Language:C++0 1 00
wplf/MIT-6.031-Software-Construction
The record of learning 6.031
Language:Java0 1 00
wplf/OI-wiki
:star2: Wiki of OI / ICPC for everyone. （某大型游戏线上攻略，内含炫酷算术魔法）
Language:TypeScript0 0 00
wplf/onnx
Open standard for machine learning interoperability
Language:Python0 0 00
wplf/UCB-CS61c-2020summer
Language:C0 1 00
wplf/Megatron-LM
Ongoing research training transformer models at scale
Language:Python
wplf/mit-65840
wplf/tinyflow
Tutorial code on how to build your own Deep Learning System in 2k Lines
Language:C++0 0
wplf/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
wplf/wplf
This is a special repository about my Github profile.
1 0
wplf/wplf.github.io
Language:HTML

wplf

Pinned Repositories

CMU-10-714

Compass_Optimizer

Compass_Unified_Parser

Competitive_Programming

MIT-6.031-Software-Construction

MIT_6.5940

my-CS-road

py_tutorial

UCB-CS161-sp24

UCB-CS61c-2020summer

wplf's Repositories

wplf/MIT_6.5940

wplf/my-CS-road

wplf/py_tutorial

wplf/UCB-CS161-sp24

wplf/CMU-10-714

wplf/Compass_Optimizer

wplf/Compass_Unified_Parser

wplf/Competitive_Programming

wplf/cs-self-learning

wplf/how-to-optimize-gemm

wplf/How_to_optimize_in_GPU

wplf/lzzplus2x

wplf/MIT-6.031-Software-Construction

wplf/OI-wiki

wplf/onnx

wplf/UCB-CS61c-2020summer

wplf/Megatron-LM

wplf/mit-65840

wplf/tinyflow

wplf/TransformerEngine

wplf/wplf

wplf/wplf.github.io