got a RuntimeError, need help plz

Question

got a RuntimeError, need help plz

kellen5l opened this issue 3 years ago · 11 comments

Hi, i transferred the code to OpenPCDet v0.52.0, but got a RuntimeError. could u help me plz.
Error:

Traceback (most recent call last):                                                                                                                                                | 0/1856 [00:00<?, ?it/s]
  File "train.py", line 202, in <module>
    main()
  File "train.py", line 171, in main
    merge_all_iters_to_one_epoch=args.merge_all_iters_to_one_epoch
  File "/home/featurize/OpenPCDet/tools/train_utils/train_utils.py", line 118, in train_model
    dataloader_iter=dataloader_iter
  File "/home/featurize/OpenPCDet/tools/train_utils/train_utils.py", line 52, in train_one_epoch
    loss.backward()
  File "/environment/miniconda3/lib/python3.7/site-packages/torch/_tensor.py", line 307, in backward
    torch.autograd.backward(self, gradient, retain_graph, create_graph, inputs=inputs)
  File "/environment/miniconda3/lib/python3.7/site-packages/torch/autograd/__init__.py", line 156, in backward
    allow_unreachable=True, accumulate_grad=True)  # allow_unreachable flag
RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.cuda.FloatTensor [4611, 64]], which is output 0 of ReluBackward0, is at version 1; expected version 0 instead. Hint: enable anomaly detection to find the operation that failed to compute its gradient, with torch.autograd.set_detect_anomaly(True).

my environment :

ubuntu 20.04
cuda 11.3
python 3.7.10
torch 1.10.0+cu113
spconv-cu113 2.1.21

Answer 1 · 2022-04-29T08:10:34.000Z

Hey, how did you solve the problem? Can you please share?

Answer 2 · 2022-06-07T15:11:06.000Z

same question

Answer 3 · 2022-06-07T15:11:25.000Z

收到谢谢

Answer 4 · 2022-09-15T01:12:03.000Z

@kellen5l @Raiden-cn @VsionQing @PointsCoder
Hi, how did you solve the problem?

RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.cuda.FloatTensor [4611, 64]], which is output 0 of ReluBackward0, is at version 1; expected version 0 instead. Hint: enable anomaly detection to find the operation that failed to compute its gradient, with torch.autograd.set_detect_anomaly(True).

my environment :

ubuntu 20.04
cuda 11.3
python 3.7.10
torch 1.10.0+cu113
spconv-cu113

Answer 5 · 2022-09-15T01:12:23.000Z

收到谢谢

Answer 6 · 2022-12-05T06:50:42.000Z

Same issue here too. Anyone who solved this problem?

Answer 7 · 2023-03-28T15:42:21.000Z

have encountered the same problem. Has anyone solved it？

Answer 8 · 2023-03-28T15:42:45.000Z

收到谢谢

Answer 9 · 2023-04-20T09:34:38.000Z

me too,please,thank you

Answer 10 · 2023-08-08T02:55:48.000Z

@kellen5l @Raiden-cn @VsionQing @PointsCoder Hi, how did you solve the problem?

RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.cuda.FloatTensor [4611, 64]], which is output 0 of ReluBackward0, is at version 1; expected version 0 instead. Hint: enable anomaly detection to find the operation that failed to compute its gradient, with torch.autograd.set_detect_anomaly(True).

my environment :

ubuntu 20.04 cuda 11.3 python 3.7.10 torch 1.10.0+cu113 spconv-cu113

Hi,have you solved this problem?

Answer 11 · 2023-11-30T02:15:55.000Z

anybody solve this problem?
thansks