What is the purpose of this code?

Question

What is the purpose of this code?

scuizhibin opened this issue 3 years ago · 8 comments

int requestedAlgoCount = CUDNN_CONVOLUTION_FWD_ALGO_COUNT;
int returnedAlgoCount = -1;
cudnnConvolutionFwdAlgoPerf_t fwd_results[2 * requestedAlgoCount];
checkCUDNN(cudnnFindConvolutionForwardAlgorithm(*handle, src_tensor_desc,
filter_desc, conv_desc, dst_tensor_desc, requestedAlgoCount,
&returnedAlgoCount, fwd_results));
fwd_algo = fwd_results[0].algo;

Answer 1 · 2021-11-26T07:10:33.000Z

You can refer to cudnnFindConvolutionForwardAlgorithm() in cuDNN-API document.
cuDNN implement different algorithm for convolution. Different algorithms require different need for memory and have different performance. This function attempts all algorithms available for cudnnConvolutionForward(). It will attempt both the provided convDesc mathType and CUDNN_DEFAULT_MATH. Therefore, you can see the "fwd_results" is a 2 * requestedAlgoCount array.
This output "fwd_results" is a user-allocated array to store performance metrics sorted ascending by compute time. So I choose the first algorithm. If you want lower memory size, you can make other choices.
“fwd_algo” will later be used in cudnnGetConvolutionForwardWorkspaceSize() to allocate temporary memory size according to the algorithm you choose and the actual execution of convolution cudnnConvolutionForward() also need it.

Answer 2 · 2021-11-27T08:37:31.000Z

Why are there two algorithm choices in backward propagation, cudnnConvolutionBackwardFilter and cudnnConvolutionBackwardData. What is their role? Can you answer it?

Answer 3 · 2021-11-28T02:46:04.000Z

In backward propagation, cudnnConvolutionBackwardFilter is used to get the differential of filter, and cudnnConvolutionBackwardData is used to get the differential of input. These two operation is different, so they need different algorithm.

Answer 4 · 2021-11-28T02:50:43.000Z

The total number of resulting algorithms can be queried through the API cudnnGetConvolutionForwardAlgorithmMaxCount(). Also, you can find it in cudnn.h

Answer 5 · 2021-11-28T03:02:11.000Z

What is the process of cudnn backpropagation? There is too little information on the Internet.

Answer 6 · 2021-11-28T03:14:37.000Z

cuDNN doesn’t provide api for getting differential from loss, so you should write your own code. After that, you can refer to backpropagation algorithm and use cuDNN api to compute the differential.

Answer 7 · 2021-11-28T04:14:04.000Z

What does this ”cudnnConvolutionBackwardData is used to get the differential of input“ mean? like Is the derivative of p in y(p(x))?

Answer 8 · 2021-11-28T05:24:06.000Z

What does this ”cudnnConvolutionBackwardData is used to get the differential of input“ mean? like Is the derivative of p in y(p(x))?

Yes. After that, you can use the derivative of p to get the derivative of parameters in p(x), so the output of cudnnConvolutionBackwardData would be passed to the backforward function of its previous operation. You can refer to main.cu.