Pinned Repositories
TLR-MDC
Tile-Low Rank Multi-Dimensional Convolution: fast MDC modelling and inversion for seismic applications
tlrmvm
ACSpGEMM
Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"
arrow-matrix
Arrow Matrix Decomposition - Communication-Efficient Distributed Sparse Matrix Multiplication
caffe
Caffe: a fast open framework for deep learning.
CANDMC
Communication Avoiding Numerical Dense Matrix Computations
ccls
C/C++/ObjC language server supporting cross references, hierarchies, completion and semantic highlighting
core_scheduler
CoreScheduler: A High-Performance Scheduler for Large Model Training
cub
THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.
dgSPARSE-Lib
PyTorch-Based Fast and Efficient Processing for Various Machine Learning Applications with Diverse Sparsity
hongyx11's Repositories
hongyx11/caffe
Caffe: a fast open framework for deep learning.
hongyx11/CANDMC
Communication Avoiding Numerical Dense Matrix Computations
hongyx11/core_scheduler
CoreScheduler: A High-Performance Scheduler for Large Model Training
hongyx11/cub
THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.
hongyx11/dplasma4dare
hongyx11/emacs.d
An Emacs configuration bundle with batteries included
hongyx11/fast_matrix_market
Fast and full-featured Matrix Market I/O library for C++, Python, and R
hongyx11/FunCoding
Leetcode, Codeforces, etc.
hongyx11/How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce in detail how to optimize the program on the GPU. The reduce optimization has been completed. The optimization of GEMM has completed the CUDA C code. The assembler is currently being used to tune the code, and the code will be issued later.
hongyx11/HPC_connect
hongyx11/leetcode
leetcode records
hongyx11/libsvm
hongyx11/MLgeoscience
Teaching material for ML in Geoscience course
hongyx11/moao
MOAO simulation framework for manycore architectures.
hongyx11/mpi4py_tutorial
hongyx11/MRR_1D
code for MRR paper
hongyx11/numerical-linear-algebra
Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course
hongyx11/parsec4dare
Parsec Impl for dare project
hongyx11/pylops
PyLops – A Linear-Operator Library for Python
hongyx11/stream-benchmark
hongyx11/templatebook
hongyx11/thrust
The C++ parallel algorithms library.
hongyx11/Tilematrix.jl
A tilematrix julia package that provide mixed precision features
hongyx11/TLR-MVM_Perf
Performance Records for TLR-MVM and TLR-MMM
hongyx11/vscode-cmake-tools
CMake integration in Visual Studio Code
hongyx11/zotero-better-notes
Everything about note management. All in Zotero.