Inference error

Question

Opened this issue 5 months ago · 3 comments

I used this command for inference but encountered issue. Anyone knows how to fix this?

command: python launch.py --n_GPUs 1 main.py --batch_size 8 --precision single
error :
[W socket.cpp:401] [c10d] The server socket has failed to bind to [::]:8023 (errno: 98 - Address already in use). [W socket.cpp:401] [c10d] The server socket has failed to bind to workstation2:8023 (errno: 98 - Address already in use). [E socket.cpp:435] [c10d] The server socket has failed to listen on any local network address.

Answer 1 · 2024-04-01T16:44:10.000Z

Are you launching many jobs from a single machine? Use different master ports per job.
https://github.com/SeungjunNah/DeepDeblur-PyTorch/blob/master/src/option.py#L33
Please provide more details.

Answer 2 · 2024-04-02T09:15:48.000Z

Answer 3 · 2024-04-03T04:22:26.000Z

# save all of the evaluation results
python main.py --n_GPUs 1 --batch_size 8 --dataset GOPRO_Large --save_results all

Please refer to args.end_epoch and see how it is used in main.py.