Pinned Repositories
megablocks
sputnik
A library of GPU kernels for sparse matrix operations.
stk
convolutions
Hacked up convolution kernels
cutlass
CUDA Templates for Linear Algebra Subroutines
cutlass-issue
grouped_gemm
PyTorch bindings for CUTLASS grouped GEMM.
memory-address-trace-tools
quark
A miniature framework for numeric computation on GPU
tum-rgbd-associate
Scripts to do three-way data association, build hdf5 datasets, and improved two way data association for the TUM RGBD dataset
tgale96's Repositories
tgale96/grouped_gemm
PyTorch bindings for CUTLASS grouped GEMM.
tgale96/tum-rgbd-associate
Scripts to do three-way data association, build hdf5 datasets, and improved two way data association for the TUM RGBD dataset
tgale96/memory-address-trace-tools
tgale96/quark
A miniature framework for numeric computation on GPU
tgale96/sputnik
tgale96/convolutions
Hacked up convolution kernels
tgale96/cutlass
CUDA Templates for Linear Algebra Subroutines
tgale96/cutlass-issue
tgale96/deepslam
tgale96/EfficientNet-PyTorch
A PyTorch implementation of EfficientNet and EfficientNetV2 (coming soon!)
tgale96/examples
Fast and flexible reference benchmarks
tgale96/gemm
Experiments with GEMM
tgale96/helm
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110).
tgale96/Megatron-LM
A formerly private fork of Megatron-LM
tgale96/tarfile-cpp
poc creating tarfile & reading contents one by one