CUDA -> CPU issue

Question

CUDA -> CPU issue

kamwoh opened this issue 6 years ago · 3 comments

def drop_connect(inputs, p, training):
    """ Drop connect. """
    if not training: return inputs
    batch_size = inputs.shape[0]
    keep_prob = 1 - p
    random_tensor = keep_prob
    random_tensor += torch.rand([batch_size, 1, 1, 1], dtype=inputs.dtype)  # uniform [0,1)
    binary_tensor = torch.floor(random_tensor)
    output = inputs / keep_prob * binary_tensor # error happens here
    return output

Faced error: RuntimeError: expected backend CUDA and dtype Float but got backend CPU and dtype Float

when I try to run on GPU, this error happens, the error direct me to this line, I think we should convert binary_tensor to inputs.device:

binary_tensor = torch.floor(random_tensor).to(inputs.device)

Answer 1 · 2019-06-20T12:54:41.000Z

Hi kamwoh, do you use the efficientnet to train your own model or use pretrained models? If you use the pretrained models, the newly released version demands to add model.eval() after the model load.

Answer 2 · 2019-06-20T13:04:53.000Z

Hi @dami23 , using pretrained models to finetune, yes I did model.eval()

Answer 3 · 2019-06-20T18:19:01.000Z

Yup, this is now fixed in master. See #29
I'll push out another release to pip soon.