potential random issue with DTypes

Question

potential random issue with DTypes

Closed this issue 8 years ago · 8 comments

Currently random number generation only supporting float and double type using cuRAND. According to cuRAND doc(CUDA7.5, I haven't found the link to CUDA8.0), half type random number is not yet supported. A candidate solution is to create one extra float type tensor to generate values and convert them into DTypes other than float or double.

Currently the Random is used in dropout layer to created mask, which might be a issue if we want to support DType.
@tqchen

Answer 1 · 2016-06-23T02:09:13.000Z

What we can do is create a random number using real, and run a cast to cast the result

Answer 2 · 2016-06-23T02:22:42.000Z

I know, but cuRAND creates a chunk of random numbers given the pointer. So either we generate and cast them one by one to save space, or cast the entire chunk to save time. I'm not sure if the device api could be applied in this case.

Answer 3 · 2016-06-28T08:28:58.000Z

problem is bypassed.

Answer 4 · 2016-06-28T10:41:57.000Z

How did you bypass the problem?

Answer 5 · 2016-06-28T15:03:40.000Z

By using tcast in dropout layer. just like what the cast layer do. I checked pdf files in cuda8.0 for cuRAND, but I don't see any half random generation stuff. In future, a Dtype random generation might be needed.

Answer 6 · 2016-06-28T15:15:00.000Z

The only place right now where that solution is unsatisfactory is for Float64 where we limit the amount of randomness to 32bit

Answer 7 · 2016-06-28T16:36:33.000Z

Let me check my dropout code. I think we can solve it like cudnn batch norm with some switch case stuff. Also it seems that cudnn dropout is not implemented in mxnet.

Answer 8 · 2016-06-29T01:55:54.000Z

I just checked my dropout. I didn't use 'swith case' suff in mask generation. The default real_t is already good enough.