Pinned Repositories
arasgungore-CV
My curriculum vitae (CV) written using LaTeX.
BladeDISC
BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.
ByteTransformer
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
Chinavis2021-Prjsjtu
cutlass
CUDA Templates for Linear Algebra Subroutines
cyberkillor.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
FasterTransformer
Transformer related optimization, including BERT, GPT
FrontEnd-Learning
OCT-eyeDiseaseClassification
SJTU-OnlineJudge
cyberkillor's Repositories
cyberkillor/Chinavis2021-Prjsjtu
cyberkillor/OCT-eyeDiseaseClassification
cyberkillor/SJTU-OnlineJudge
cyberkillor/arasgungore-CV
My curriculum vitae (CV) written using LaTeX.
cyberkillor/BladeDISC
BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.
cyberkillor/ByteTransformer
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
cyberkillor/cutlass
CUDA Templates for Linear Algebra Subroutines
cyberkillor/cyberkillor.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
cyberkillor/FasterTransformer
Transformer related optimization, including BERT, GPT
cyberkillor/FrontEnd-Learning
cyberkillor/Awesome-DL-Scheduling-Papers
cyberkillor/glake
GLake: optimizing GPU memory management and IO transmission.
cyberkillor/Image_Segmentation
Pytorch implementation of U-Net, R2U-Net, Attention U-Net, and Attention R2U-Net.
cyberkillor/ImageSegmentationEM
cyberkillor/llama.onnx
llama/alpaca onnx models, quantization and testcase
cyberkillor/llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.
cyberkillor/stablehlo
Backward compatible ML compute opset inspired by HLO/MHLO
cyberkillor/TensorRT
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
cyberkillor/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
cyberkillor/xla
A community-driven and modular open source compiler for ML.