train.py: error: unrecognized arguments: --local-rank=0
Opened this issue · 3 comments
davidvct commented
Encounter this error when trying to train GoPro datasets:
python -m torch.distributed.launch --nproc_per_node=1 --master_port=4321 train.py -opt options/train/GoPro/NAFNet-width32.yml --launcher pytorch
I searched the train.py, there is no --local-rank=0
.
How to fix?
txy00001 commented
sentinel8b commented
Change
parser.add_argument('--local_rank', type=int, default=0)
To
parser.add_argument('--local-rank', type=int, default=0)
And I didn't add
os.environ['RANK'] = str(0)
FogSue commented
Change
parser.add_argument('--local_rank', type=int, default=0)To
parser.add_argument('--local-rank', type=int, default=0)And I didn't add
os.environ['RANK'] = str(0)
thanks,when i try to use torchrun it reported:”can not open python:no such file“,when i follow your change,it works!