Pinned Repositories
AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
bazel-compile-commands-extractor
Goal: Enable awesome tooling for Bazel users of the C language family.
benchmark
A microbenchmark support library
cutlass
CUDA Templates for Linear Algebra Subroutines
deepmd-kit
A deep learning package for many-body potential energy representation and molecular dynamics
disc_tf_community
dssm
mlir
"Multi-Level Intermediate Representation" Compiler Infrastructure
scholar.py
A parser for Google Scholar, written in Python
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
fwd4's Repositories
fwd4/dssm
fwd4/scholar.py
A parser for Google Scholar, written in Python
fwd4/AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
fwd4/bazel-compile-commands-extractor
Goal: Enable awesome tooling for Bazel users of the C language family.
fwd4/benchmark
A microbenchmark support library
fwd4/cutlass
CUDA Templates for Linear Algebra Subroutines
fwd4/deepmd-kit
A deep learning package for many-body potential energy representation and molecular dynamics
fwd4/disc_tf_community
fwd4/mlir
"Multi-Level Intermediate Representation" Compiler Infrastructure
fwd4/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
fwd4/BILIBILI-HELPER
B站,哔哩哔哩(Bilibili)自动签到每日自动投币,银瓜子兑换硬币,领取大会员福利,大会员月底给自己充电等。每天轻松获取65经验值。赶快和我一起成为Lv6吧!
fwd4/float-toy
Use this to build intuition for the IEEE floating-point format
fwd4/fwd4.github.io
fwd4/Linux-Device-Driver
Examples of Linux Device Drivers
fwd4/tensorflow
Computation using data flow graphs for scalable machine learning
fwd4/wincnn
Winograd minimal convolution algorithm generator for convolutional neural networks.