nirvedhmeshram

Pinned Repositories

cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++0 0 00
FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
Language:C++0 0 00
five-letter-words
Experiments with Knuth's 5,757 five letter words.
Language:Python0 0 00
gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Language:Python0 0 00
SHARK
Distributed SHARK
Language:Python1 0 00

nirvedhmeshram's Repositories

nirvedhmeshram/SHARK
Distributed SHARK
Language:Python1 0 00
nirvedhmeshram/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++0 0 00
nirvedhmeshram/FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
Language:C++0 0 00
nirvedhmeshram/five-letter-words
Experiments with Knuth's 5,757 five letter words.
Language:Python0 0 00
nirvedhmeshram/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Language:Python0 0 00
nirvedhmeshram/hbc_verification
Language:MLIR00
nirvedhmeshram/iree
👻
Language:C++1 0
nirvedhmeshram/llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.
Language:LLVM0 0
nirvedhmeshram/llvm-test-suite
Language:Logos0 0
nirvedhmeshram/mmperf
MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.
Language:C++0 0
nirvedhmeshram/PI
A lightweight MLIR Python frontend with support for PyTorch
Language:Python0 0
nirvedhmeshram/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python0 0
nirvedhmeshram/torch-mlir
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
Language:C++0 0