Pinned Repositories
ANNS_Reproduce
A Comprehensive Survey and Experimental Comparison of Graph-based Approximate Nearest Neighbor Search
Berti-Artifact
An artifact for Berti: an Accurate and Timely Local-Delta Data Prefetcher
Computer-Architecture-Learning
cry-daniel.github.io
A beautiful, simple, clean, and responsive Jekyll theme for academics
cuda_hgemm
Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.
CudaGemmDIY
DIY a matrix multiplication with cuda and tensor core, from easy understanding to run fast
dgl
Python package built to ease deep learning on graph, on top of existing DL frameworks.
FusedMM
Implementation of FusedMM method for IPDPS 2021 paper titled "FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural Networks"
FusedMM4DGL
GANNS
Article: GPU-accelerated Proximity Graph Approximate Nearest Neighbor Search and Construction by Authors Yuanhang Yu, Dong Wen, Ying Zhang, Lu Qin, Wenjie Zhang and Xuemin Lin
cry-daniel's Repositories
cry-daniel/Computer-Architecture-Learning
cry-daniel/GANNS
Article: GPU-accelerated Proximity Graph Approximate Nearest Neighbor Search and Construction by Authors Yuanhang Yu, Dong Wen, Ying Zhang, Lu Qin, Wenjie Zhang and Xuemin Lin
cry-daniel/ANNS_Reproduce
A Comprehensive Survey and Experimental Comparison of Graph-based Approximate Nearest Neighbor Search
cry-daniel/Berti-Artifact
An artifact for Berti: an Accurate and Timely Local-Delta Data Prefetcher
cry-daniel/cry-daniel.github.io
A beautiful, simple, clean, and responsive Jekyll theme for academics
cry-daniel/cuda_hgemm
Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.
cry-daniel/CudaGemmDIY
DIY a matrix multiplication with cuda and tensor core, from easy understanding to run fast
cry-daniel/dgl
Python package built to ease deep learning on graph, on top of existing DL frameworks.
cry-daniel/FusedMM
Implementation of FusedMM method for IPDPS 2021 paper titled "FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural Networks"
cry-daniel/FusedMM4DGL
cry-daniel/GKAT
cry-daniel/ogb
Benchmark datasets, data loaders, and evaluators for graph machine learning
cry-daniel/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
cry-daniel/reproduce-cgo2017-paper
Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.
cry-daniel/sfvsfv.github.io
ChatGPt国内镜像版,项目源码和使用教程。
cry-daniel/pytorch_sparse
PyTorch Extension Library of Optimized Autograd Sparse Matrix Operations
cry-daniel/QGTC_PPoPP22
Artifact for PPoPP22 QGTC: Accelerating Quantized GNN via GPU Tensor Core.
cry-daniel/rnn-descent