Pinned Repositories
3D-Machine-Learning
A resource repository for 3D machine learning
academic-kickstart
accel-sim-framework
This is the top-level repository for the Accel-Sim framework.
algorithms
Algorithms & Data structures in C++.
algorithms_and_data_structures
160+ Algorithm & Data Structure Problems using C++
DissectingTensorCores
ECA-GPU2022
sunlex0717's Repositories
sunlex0717/DissectingTensorCores
sunlex0717/ECA-GPU2022
sunlex0717/accel-sim-framework
This is the top-level repository for the Accel-Sim framework.
sunlex0717/Awesome-Pruning
A curated list of neural network pruning resources.
sunlex0717/awesome-semantic-segmentation
:metal: awesome-semantic-segmentation
sunlex0717/bitfusion
Simulator for BitFusion
sunlex0717/CMix-NN
CMix-NN: Mixed Low-Precision CNN Library for Memory-Constrained Edge Devices
sunlex0717/CppCoreGuidelines
The C++ Core Guidelines are a set of tried-and-true guidelines, rules, and best practices about coding in C++
sunlex0717/cutlass
CUDA Templates for Linear Algebra Subroutines
sunlex0717/distiller
Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://nervanasystems.github.io/distiller
sunlex0717/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
sunlex0717/flash-attention
Fast and memory-efficient exact attention
sunlex0717/free-programming-books
:books: Freely available programming books
sunlex0717/gpgpu-sim_distribution
GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as well as a performance visualization tool, AerialVisoin, and an integrated energy model, GPUWattch.
sunlex0717/gpu-arch-microbenchmark
Dissecting NVIDIA GPU Architecture
sunlex0717/incubator-tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
sunlex0717/interview
📚 C/C++面试知识总结
sunlex0717/metal-cpp-cmake
Metal graphic api ported to C++ with CMake build system
sunlex0717/MinkowskiEngine
Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors
sunlex0717/mmdetection
Open MMLab Detection Toolbox and Benchmark
sunlex0717/muduo
Event-driven network library for multi-threaded Linux server in C++11
sunlex0717/netron
Visualizer for neural network, deep learning and machine learning models
sunlex0717/pdfs
Technically-oriented PDF Collection (Papers, Specs, Decks, Manuals, etc)
sunlex0717/Python-Books
sunlex0717/pytorch-OpCounter
Count the MACs / FLOPs of your PyTorch model.
sunlex0717/stanford-compilers-coursework
This is a repository of coursework project for the Stanford Compilers MOOC course. The result is a fully-working compiler for the COOL Programming Language.
sunlex0717/sunlex0717.github.io
A beautiful, simple, clean, and responsive Jekyll theme for academics
sunlex0717/timeloop
Timeloop performs modeling, mapping and code-generation for Tensor Algebra workloads running on Explicitly-Decoupled Data Orchestration (EDDO) architectures.
sunlex0717/torch2trt
An easy to use PyTorch to TensorRT converter
sunlex0717/triSYCL
Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group