GPU Add A cuda program to add two 16X32 matrices supplied by the user The host will print the result generated by the kernel