Pinned Repositories
acceleratedCpp
adaptive-pruning
algorithm
cuda-ptx
Inline PTX Assembly in CUDA example
cuda_study
fast-conv
Fast Convoluion Implementation via CUDA
ParallelMergeSort
Parallel Merge Sort Implemented via OpenMP
SpMM
Parallel Sparse Matrix Multiplication via CUDA
SYCL_easy_tutorial
sycl easy examples
tensor_core_test
jhson989's Repositories
jhson989/cuda-ptx
Inline PTX Assembly in CUDA example
jhson989/SpMM
Parallel Sparse Matrix Multiplication via CUDA
jhson989/fast-conv
Fast Convoluion Implementation via CUDA
jhson989/SYCL_easy_tutorial
sycl easy examples
jhson989/ParallelMergeSort
Parallel Merge Sort Implemented via OpenMP
jhson989/adaptive-pruning
jhson989/analyse-cudnn-conv-fwd-algo
cuDNN Convolution Forward Algorithm 분석
jhson989/cppgo
An example for C++ codes calling Go functions
jhson989/determinant_calculator
determinant_calculator
jhson989/fast-DBSCAN
Cython package for accelerated DBSCAN
jhson989/go-matmul-parallel
Just a tutorial for how to implement go concurrent methods via a matrix multiplication example (Parallelism beyond Concurrency)
jhson989/goServerAndClients
go example code for calling http Server and http Clients (at the same time)
jhson989/jhdnn
Custom dnn library via CUDA (with comparing examples via cudnn)
jhson989/learn-qiskit-quantum-computing
Tutorial for learning quantum computing via qiskit by IBM
jhson989/matmul_cublas
cuBLAS GEMM Example for FP32 MatMul
jhson989/nccl-matmul-tutorial
NCCL example for a matrix multiplication application in a single node
jhson989/ParallelBitonicSort
Parallel Bitonic Sort algorithm implementation with OpenMP
jhson989/ParallelScan
Parallel Scan Algorithm Implemented with OpenMP
jhson989/prune-by-distillation
jhson989/pycpp
Cython project for calling c++ routines from a python program
jhson989/pySYCL
python interface for SYCL
jhson989/pytorch-pruning-basic
pytorch pruning tutorial
jhson989/Redux_CRUD
jhson989/spring-api
Java Spring Demo Project for an API server
jhson989/spring-book-collector
Book Collector made with Spring Framework
jhson989/sycl-fpga-devcloud
jhson989/SYCL-heterogeneous
CPU, GPU, and FPGA matrix multiplication examples via SYCL
jhson989/SYCL-primitives
Basic parallel executed primitives implemented using SYCL
jhson989/tf-to-trt
jhson989/web-react-mobx