Pinned Repositories
cutlass_fpA_intB_gemm
libflash_attn
mxnet-cpp-inference
Test MXNet C++ API for doing inference, given a trained model
nnvm-vision-demo
Demos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM
tensorir-experiment
tf2-detection-to-tvm
torchscript-to-tvm
tvm-cutlass-eval
tvm-winograd
Test winograd convolution written in TVM for CUDA and AMDGPU
masahi's Repositories
masahi/torchscript-to-tvm
masahi/tvm-cutlass-eval
masahi/libflash_attn
masahi/cutlass_fpA_intB_gemm
masahi/tensorir-experiment
masahi/tf2-detection-to-tvm
masahi/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
masahi/iwsec21_ntt
masahi/hb_tvm_example
masahi/models
Models and examples built with TensorFlow
masahi/ocaml_practice
masahi/pytorch-ssd
MobileNetV1, MobileNetV2, VGG based SSD/SSD-lite implementation in Pytorch 1.0 / Pytorch 0.4. Out-of-box support for retraining on Open Images dataset. ONNX and Caffe2 support. Experiment Ideas like CoordConv.
masahi/pytorch_quantization
Test scripts for exploring PyTorch JIT and quantization capability
masahi/ci
Repository which handles configuration of TVM CI infrastructure.
masahi/cutlass
CUDA Templates for Linear Algebra Subroutines
masahi/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
masahi/Halide
a language for fast, portable data-parallel computation
masahi/hummingbird
Hummingbird compiles trained ML models into tensor computation for faster inference.
masahi/int8_experiment
masahi/llmperf
LLMPerf is a library for validating and benchmarking LLMs
masahi/mlc-llm
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
masahi/nttverify
Artifact for FLOPS 2022 paper
masahi/openmoonray
MoonRay is DreamWorks’ open-source, award-winning, state-of-the-art production MCRT renderer.
masahi/relax
Temp repo for prototyping relax(relay next), the effort will be upstreamed. We use the wiki pages on this repo to host design docs.
masahi/taichi
Productive programming language for portable, high-performance, sparse & differentiable computing
masahi/TLCBench
Benchmark scripts for TVM
masahi/triton
Development repository for the Triton language and compiler
masahi/tvm-rfcs
A home for the final text of all TVM RFCs.
masahi/vision
Datasets, Transforms and Models specific to Computer Vision
masahi/web-stable-diffusion
Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.