/csc656-vecaddperf

In this assignment, I will write 5 different implementations of a vector addition code: one for the CPU written in C++, and 4 different ones for GPU written in CUDA. I'll execute these codes, measure runtime and computer memory bandwidth utilized, and generate a brief report containing my runtime data.

Primary LanguageCuda

No issues in this repository yet.