limhoyeon/ToothGroupNetwork

when I was training I got the following error

caogj-0521 opened this issue · 3 comments

Hello:

I'm using the main branch an follow the procedure:

  1. use challenger dataset to process,
    2.use processing data to train
    3.train command:
    python start_train.py --model_name tgnet_fps --config_path train_configs/tgnet_fps.py --experiment_name tgnet --input_data_dir_path processDir --train_data_split_txt_path split/base_name_train_fold.txt --val_data_split_txt_path split/base_name_val_fold.txt
    However, the following error occure
    image

When I comment out the code ‘raise’:
the following error occurre
39d08ca81aeab5de92173beaf461f0a

My running environment:
The graphics card is RTX 3090
Python 3.7.16
torch: 1.13.1+cu117, cuda 11.7,cudnn 8500
Ubuntu 20.4

Now I don't know where the problem is, Can you help me?

maybe this issue is related to #4.
there are issues with CUDA functions not executing properly in some PC environments. I plan to remove the CUDA functions and replace them with Python code. Please bear with me a little longer, or alternatively, you can try reinstalling the 'pointops' library from https://github.com/POSTECH-CVLab/point-transformer/tree/master/lib/pointops.

Hello:

I'm using the main branch an follow the procedure:

  1. use challenger dataset to process,
    2.use processing data to train
    3.train command:
    python start_train.py --model_name tgnet_fps --config_path train_configs/tgnet_fps.py --experiment_name tgnet --input_data_dir_path processDir --train_data_split_txt_path split/base_name_train_fold.txt --val_data_split_txt_path split/base_name_val_fold.txt
    However, the following error occure
    image

When I comment out the code ‘raise’: the following error occurre 39d08ca81aeab5de92173beaf461f0a

My running environment: The graphics card is RTX 3090 Python 3.7.16 torch: 1.13.1+cu117, cuda 11.7,cudnn 8500 Ubuntu 20.4

Now I don't know where the problem is, Can you help me?

Hello, have you solved this problem? I encountered the same problem as you.
My running environment:
The graphics card is RTX 4090
Python 3.6.13
pytorch1.10 cuda11.7 cudnn
Ubuntu 20.4
Hope your reply. :)