deepmodeling/dpdispatcher

paramiko.ssh_exception.AuthenticationException: Authentication failed.

caojiachun opened this issue · 6 comments

When I updated dpgen from 10.0 to 10.6, I encountered an ssh error

dpgen: 10.0 to 10.6

ssh erro for dpgen10.0

image

image
I solved the problem by changing the look_for_keys in client.py for 10.0 version, however, It doesnt work for 10.6 version.

ssh erro for dpgen10.6

Traceback (most recent call last):
File "/data/jccao/app/deepmd2_1_5/bin/dpgen", line 8, in
sys.exit(main())
File "/data/jccao/app/deepmd2_1_5/lib/python3.10/site-packages/dpgen/main.py", line 185, in main
args.func(args)
File "/data/jccao/app/deepmd2_1_5/lib/python3.10/site-packages/dpgen/generator/run.py", line 3642, in gen_run
run_iter (args.PARAM, args.MACHINE)
File "/data/jccao/app/deepmd2_1_5/lib/python3.10/site-packages/dpgen/generator/run.py", line 3607, in run_iter
run_train (ii, jdata, mdata)
File "/data/jccao/app/deepmd2_1_5/lib/python3.10/site-packages/dpgen/generator/run.py", line 598, in run_train
submission = make_submission(
File "/data/jccao/app/deepmd2_1_5/lib/python3.10/site-packages/dpgen/dispatcher/Dispatcher.py", line 359, in make_submission
machine = Machine.load_from_dict(abs_mdata_machine)
File "/data/jccao/app/deepmd2_1_5/lib/python3.10/site-packages/dpdispatcher/machine.py", line 129, in load_from_dict
context = BaseContext.load_from_dict(machine_dict)
File "/data/jccao/app/deepmd2_1_5/lib/python3.10/site-packages/dpdispatcher/base_context.py", line 35, in load_from_dict
context = context_class.load_from_dict(context_dict)
File "/data/jccao/app/deepmd2_1_5/lib/python3.10/site-packages/dpdispatcher/ssh_context.py", line 350, in load_from_dict
ssh_context = cls(
File "/data/jccao/app/deepmd2_1_5/lib/python3.10/site-packages/dpdispatcher/ssh_context.py", line 323, in init
self.ssh_session = SSHSession(**remote_profile)
File "/data/jccao/app/deepmd2_1_5/lib/python3.10/site-packages/dpdispatcher/ssh_context.py", line 44, in init
self._setup_ssh()
File "/data/jccao/app/deepmd2_1_5/lib/python3.10/site-packages/dpdispatcher/utils.py", line 162, in wrapper
return func(*args, **kwargs)
File "/data/jccao/app/deepmd2_1_5/lib/python3.10/site-packages/dpdispatcher/ssh_context.py", line 166, in _setup_ssh
ts.auth_password(self.username, self.password)
File "/data/jccao/app/deepmd2_1_5/lib/python3.10/site-packages/paramiko/transport.py", line 1564, in auth_password
return self.auth_handler.wait_for_response(my_event)
File "/data/jccao/app/deepmd2_1_5/lib/python3.10/site-packages/paramiko/auth_handler.py", line 245, in wait_for_response
raise e
paramiko.ssh_exception.AuthenticationException: Authentication failed.

Details

I run my "train" work in remote cluster and "fp" works in another cluster. The erro only exist in "train" work.
The version of deepkit is 2.1.5.

image
image

You can see deepmodeling/dpgen#722.
sorry, the erro in deepmodeling/dpgen#722 is different from mine.

You can see deepmodeling/dpgen#722.
sorry, the erro in deepmodeling/dpgen#722 is different from mine.

Actually I want to say Do you have to use SSHContext? Maybe you can use LocalContext to transfer files.

You can see deepmodeling/dpgen#722.
sorry, the erro in deepmodeling/dpgen#722 is different from mine.

Actually I want to say Do you have to use SSHContext? Maybe you can use LocalContext to transfer files.

Ok, thank you for your answer.

njzjz commented

We may add a parameter look_for_keys just like paramiko.