Graphics card memory overflow during the test
em0il opened this issue · 2 comments
Hardware:
System 1: Nvidia GTX 970 (4GB memory)
System 2: Nvidia RTX 2080 (11GB memory)
(these are two separate systems).
I tried running the tests with the following command:
python tools/run_net.py --cfg configs/EPIC-KITCHENS/SLOWFAST_8x8_R50.yaml NUM_GPUS 1 OUTPUT_DIR /path/to/experiment_dir EPICKITCHENS.VISUAL_DATA_DIR /path/to/dataset EPICKITCHENS.ANNOTATIONS_DIR /path/to/annotations TRAIN.ENABLE False TEST.ENABLE True TEST.CHECKPOINT_FILE_PATH /path/to/experiment_dir/checkpoints/checkpoint_best.pyth EPICKITCHENS.TEST_LIST EPIC_100_test_timestamps.pkl EPICKITCHENS.TEST_SPLIT test
(I have adapted the paths to my systems in each case. The Annotation repository, SlowFast repository and SlowFast model were all downloaded on 2021-05-18.)
When trying to run the test on system 1 it aborted with "Killed".
Thereupon I shortened the EPIC_100_test_timestamps.pkl to one annotation.
When running the test with the shortened pickle file I got this error (see picture).
I thought I needed more graphics card memory so I switched to the System 2.
When running the test with the original data, it was also terminated with "Killed".
When running the test with the shortened Pickel file, I received the same error as on system 1 (see picture).
What is the reason for this?
Is there a problem loading the data?
Hi,
Thanks for using our code. Can you try setting TEST.BATCH_SIZE 1
and let me know if it solves your issue?
Thank you, the test can now be run on system 2. system 1 seems to be too weak for this task.
Thank you for your quick reply.
I wish you good luck for the challenge.