tgale96

ME, USA

Pinned Repositories

megablocks
Language:Python1.2k 19 53169
sputnik
A library of GPU kernels for sparse matrix operations.
Language:C++241 10 850
stk
Language:Python85 3 618
convolutions
Hacked up convolution kernels
Language:Cuda0 1 00
cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++0 0 00
cutlass-issue
Language:Cuda0 1 00
grouped_gemm
PyTorch bindings for CUTLASS grouped GEMM.
Language:Cuda44 2 936
memory-address-trace-tools
Language:Python1 1 00
quark
A miniature framework for numeric computation on GPU
Language:C++1 1 00
tum-rgbd-associate
Scripts to do three-way data association, build hdf5 datasets, and improved two way data association for the TUM RGBD dataset
Language:Python2 1 00

tgale96's Repositories

tgale96/grouped_gemm
PyTorch bindings for CUTLASS grouped GEMM.
Language:Cuda44 2 936
tgale96/tum-rgbd-associate
Scripts to do three-way data association, build hdf5 datasets, and improved two way data association for the TUM RGBD dataset
Language:Python2 1 00
tgale96/memory-address-trace-tools
Language:Python1 1 00
tgale96/quark
A miniature framework for numeric computation on GPU
Language:C++1 1 00
tgale96/sputnik
Language:Cuda1 2 00
tgale96/convolutions
Hacked up convolution kernels
Language:Cuda0 1 00
tgale96/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++0 0 00
tgale96/cutlass-issue
Language:Cuda0 1 00
tgale96/deepslam
Language:Python00
tgale96/EfficientNet-PyTorch
A PyTorch implementation of EfficientNet and EfficientNetV2 (coming soon!)
Language:Python0 0
tgale96/examples
Fast and flexible reference benchmarks
Language:Python0 0
tgale96/gemm
Experiments with GEMM
Language:C++1 0
tgale96/helm
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110).
Language:Python0 0
tgale96/Megatron-LM
A formerly private fork of Megatron-LM
Language:Python2 0
tgale96/tarfile-cpp
poc creating tarfile & reading contents one by one
Language:C++1 0