This repository is not active
Rutvij-P/csc656-vecaddperf
In this assignment, I will write 5 different implementations of a vector addition code: one for the CPU written in C++, and 4 different ones for GPU written in CUDA. I'll execute these codes, measure runtime and computer memory bandwidth utilized, and generate a brief report containing my runtime data.
Cuda