/ParallelComputing

UCSD FA22 CSE260 Course Project

Primary LanguageJupyter Notebook

ParallelComputing

UCSD FA22 CSE260 Course Programming Assignments

PA1 - GEMM by Blas & SIMD

Implemented GeMM(General Matrix Multiplication) by blocking algorithm Blas. Accelerated by SIMD on an ARM-64 architecture. Performance can be compared with blas method.

d

PA2 - GEMM by GPU

Implemented GeMM by CuBlas blocking algorithm accelerated by GPU.

d

PA3 - Aliev-Panfilov Model by MPI

Implemented simulation of Aliev-Panfilov model on a super machine. Communication done by MPI.

d