Simple C++ program to add two really large arrays using GPU with CUDA
Run the following command to compile
nvcc add.cu -o add_cuda
This command to execute
./add_cuda
And this command to profile (clock) it
nvprof ./add_cuda
Simple C++ program to add two really large arrays using GPU with CUDA
Run the following command to compile
nvcc add.cu -o add_cuda
This command to execute
./add_cuda
And this command to profile (clock) it
nvprof ./add_cuda