megvii-research/NAFNet

train.py: error: unrecognized arguments: --local-rank=0

Opened this issue · 3 comments

Encounter this error when trying to train GoPro datasets:
python -m torch.distributed.launch --nproc_per_node=1 --master_port=4321 train.py -opt options/train/GoPro/NAFNet-width32.yml --launcher pytorch

I searched the train.py, there is no --local-rank=0.

How to fix?

在train里添加
image

Change

parser.add_argument('--local_rank', type=int, default=0)

To

parser.add_argument('--local-rank', type=int, default=0)

And I didn't add

os.environ['RANK'] = str(0)

Change

parser.add_argument('--local_rank', type=int, default=0)

To

parser.add_argument('--local-rank', type=int, default=0)

And I didn't add

os.environ['RANK'] = str(0)

thanks,when i try to use torchrun it reported:”can not open python:no such file“,when i follow your change,it works!