The assignments of Foundation Level Programming for High-Performance Computing (DCS295)
- lab1: Performance analysis and optimization using - perf
- lab2: Multithreaded matrix vector multiplication - pthread
- lab3: Thread synchronization
- lab4: MPI - Trapezoidal Numerical Integration && Parallel Sorting by Regular Sampling (PSRS)
- lab5: Parallel matrix multiplication with MPI+OpenMP - cannon, dns
- lab6: CUDA - simple matrix addition
- lab7: CUDA - matrix multiplication
- lab8: Optimize GPU matrix multiplication
- final project: list ranking + Sparse matrix multiplication