Lurkrazy

High-Performance Computing

University of UtahSalt Lake City

Pinned Repositories

Ansor-AF-DS
This repository contains the figures, tables and source code in the ICS'24 paper: "Accelerated Auto-Tuning of GPU Kernels for Tensor Computations".
Language:Python50
1point3acres
一亩三分地论坛自动签到、答题
Language:Python0 0 00
ASPLOS_artifact
Language:C0 0 00
Auto-Tuning
An auto-Tuning script for OpenBLAS
Language:Python0 2 00
auto_feed_js
PT站一键转载脚本
Language:JavaScript0 1 00
awesome-machine-learning-in-compilers
Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation
00
Compiler-experiment
编译原理实验内容，包括词法分析器、递归下降法和预测分析法的语法分析器。使用C++编写
Language:C++57 1 028
libxsmm-and-JIT
some notes of libxsmm and JIT
1 2 00
mtx-Col-to-Row
mtx file Col-major to Row-major
Language:Python1 1 00
OpenBLAS_Kunpeng
This is a fake OpenBLAS. We are going to add some BLAS-like extension to it.
Language:Fortran00

Lurkrazy's Repositories

Lurkrazy/mtx-Col-to-Row
mtx file Col-major to Row-major
Language:Python1 1 00
Lurkrazy/1point3acres
一亩三分地论坛自动签到、答题
Language:Python0 0 00
Lurkrazy/ASPLOS_artifact
Language:C0 0 00
Lurkrazy/awesome-machine-learning-in-compilers
Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation
00
Lurkrazy/awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
0 0 00
Lurkrazy/Beijing-IPTV
最好用的北京联通IPTV频道列表。https://bjiptv.eu.org/
Language:HTML0 0 00
Lurkrazy/OpenBLAS-merge
OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.
Language:C0 1 00
Lurkrazy/chatgpt-html
chatgpt html online
Language:CSS0 0
Lurkrazy/copy-translator
简单、轻量、好用的划词翻译软件
Language:Rust0 0
Lurkrazy/cuda_sgemm
Language:Cuda0 0
Lurkrazy/EOP
VEE 22: Efficient Operator Partition for Deep Learning Inference Over Edge Servers
Language:Python0 0
Lurkrazy/frps-onekey
Language:Shell0 0
Lurkrazy/go-shadowsocks2
Modern Shadowsocks in Go
Language:Go0 0
Lurkrazy/How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
Language:Cuda0 0
Lurkrazy/interview-english
English for Tech Interview 面试中的英语
0 0
Lurkrazy/LibShalom
Language:C0 0
Lurkrazy/lurkrazy.github.io
A jekyll based resume template
Language:HTML0 01
Lurkrazy/models
Model Zoo for Intel® Architecture: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors
Language:Python0 0
Lurkrazy/MYLIB
Language:C0 0
Lurkrazy/myTLCBench
Language:Python
Lurkrazy/OpBench
based on TVM. profiling op performance with many features.
Language:Python0 0
Lurkrazy/Optimizing-SGEMM-on-NVIDIA-Turing-GPUs
Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.
Language:Cuda0 0
Lurkrazy/p1a3_script
Tampermonkey Script for 1point3acres / 一亩三分地的油猴脚本
Language:CSS0 0
Lurkrazy/PyDTNN
PyDTNN - Python Distributed Training of Neural Networks
Language:Python0 0
Lurkrazy/removed-2022-07-12
0 0
Lurkrazy/testCuda
test cuda environment
Language:Cuda1 0
Lurkrazy/TLCBench
Benchmark scripts for TVM
Language:Python0 0
Lurkrazy/Traduzir-paginas-web
Translate your page in real time using Google or Yandex
Language:JavaScript0 0
Lurkrazy/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Lurkrazy/wowchemy-hugo-themes
🔥 Hugo website builder, Hugo themes & Hugo CMS. No code, build with widgets! 创建在线课程，学术简历或初创网站。
Language:SCSS0 0