Hello,
I would like to report an error that started happening when using a new remote machine (alps-daint)
Some context information:
- I have to renew the Alps ssh key every 24h
- I am able to ssh into Alps in all times
When running an aiida workflow that takes longer than 24h, I loose connection to Alps after the first 24h, even if I renew the key. When trying to connect to Alps to check the output of calculation (unfinished) via aiida, I get:
(aiida) [local]$ verdi calcjob gotocomputer <pk>
Report: going to the remote work directory...
X11 forwarding request failed on channel 0
name@daint.alps.cscs.ch: Permission denied (publickey).
and then loose the results of the workchain. I can still ssh in Alps without aiida though. Does anyone know what is happening and how I can solve it?
Thank you
Hi Virginie, normally even if AiiDA cannot reconnect, you don’t lose the results oft the calculations. AiiDA will pause the work chain, and as soon as you gain connection again, you can ‘replay’ it with verdi process play
and similar commands, and AiiDA will continue from where it stopped.
More specifically about your question, it seems that AiiDA and your default SSH in the shell are using different configurations/keys. It would be useful to know the following:
- the output of
verdi computer show
for the relevant computer (in particular to check if you are specifying the path of the SSH key, and if this is the path that is getting replaced.
- the relevant part of your
~/.ssh/config
file, if any section on Alps is there
- does
verdi computer test
work when you refresh the key (while gotocomputer
doesn’t) or both don’t work?
- When you refresh the key, do you put a passphrase or not? (it’s easier without so there are no issues with the SSH agent etc. - anyways the keys are short lived so it shouldn’t be a major problem)
Hi Giovanni,
Thank you for your replies. My answers to your comments:
verdi computer show
does not show any specific path to the SSH key. In such case, what default path is chosen?
- the
~/.ssh/conf
file contains:
Host ela
Hostname ela.cscs.ch
User vdemestr
IdentityFile ~/.ssh/cscs-key
Host alps
Hostname daint.alps.cscs.ch
User vdemestr
Proxyjump ela
Forwardagent yes
IdentityFile ~/.ssh/cscs-key
AddKeysToAgent yes
where ela is a proxy.
To add some more details:
For privacy reasons, I think it would be better to continue this conversation elsewhere 