epic-kitchens/epic-kitchens-slowfast

Graphics card memory overflow during the test

em0il opened this issue · 2 comments

em0il commented

Hardware:
System 1: Nvidia GTX 970 (4GB memory)
System 2: Nvidia RTX 2080 (11GB memory)
(these are two separate systems).

I tried running the tests with the following command:

python tools/run_net.py --cfg configs/EPIC-KITCHENS/SLOWFAST_8x8_R50.yaml NUM_GPUS 1 OUTPUT_DIR /path/to/experiment_dir EPICKITCHENS.VISUAL_DATA_DIR /path/to/dataset EPICKITCHENS.ANNOTATIONS_DIR /path/to/annotations TRAIN.ENABLE False TEST.ENABLE True TEST.CHECKPOINT_FILE_PATH /path/to/experiment_dir/checkpoints/checkpoint_best.pyth EPICKITCHENS.TEST_LIST EPIC_100_test_timestamps.pkl EPICKITCHENS.TEST_SPLIT test

(I have adapted the paths to my systems in each case. The Annotation repository, SlowFast repository and SlowFast model were all downloaded on 2021-05-18.)

When trying to run the test on system 1 it aborted with "Killed".
Thereupon I shortened the EPIC_100_test_timestamps.pkl to one annotation.
When running the test with the shortened pickle file I got this error (see picture).
Screenshot from 2021-05-18 09-35-51
I thought I needed more graphics card memory so I switched to the System 2.
When running the test with the original data, it was also terminated with "Killed".
When running the test with the shortened Pickel file, I received the same error as on system 1 (see picture).
Bildschirmfoto vom 2021-05-18 13-07-50

What is the reason for this?
Is there a problem loading the data?

Hi,

Thanks for using our code. Can you try setting TEST.BATCH_SIZE 1 and let me know if it solves your issue?

em0il commented

Thank you, the test can now be run on system 2. system 1 seems to be too weak for this task.
Thank you for your quick reply.
I wish you good luck for the challenge.