Disconnection from remote machine, permission denied

Hello,

I would like to report an error that started happening when using a new remote machine (alps-daint)

Some context information:

  • I have to renew the Alps ssh key every 24h
  • I am able to ssh into Alps in all times

When running an aiida workflow that takes longer than 24h, I loose connection to Alps after the first 24h, even if I renew the key. When trying to connect to Alps to check the output of calculation (unfinished) via aiida, I get:

(aiida) [local]$ verdi calcjob gotocomputer <pk>
Report: going to the remote work directory...
X11 forwarding request failed on channel 0
name@daint.alps.cscs.ch: Permission denied (publickey).

and then loose the results of the workchain. I can still ssh in Alps without aiida though. Does anyone know what is happening and how I can solve it?

Thank you

Hi Virginie, normally even if AiiDA cannot reconnect, you don’t lose the results oft the calculations. AiiDA will pause the work chain, and as soon as you gain connection again, you can ‘replay’ it with verdi process play and similar commands, and AiiDA will continue from where it stopped.

More specifically about your question, it seems that AiiDA and your default SSH in the shell are using different configurations/keys. It would be useful to know the following:

  • the output of verdi computer show for the relevant computer (in particular to check if you are specifying the path of the SSH key, and if this is the path that is getting replaced.
  • the relevant part of your ~/.ssh/config file, if any section on Alps is there
  • does verdi computer test work when you refresh the key (while gotocomputer doesn’t) or both don’t work?
  • When you refresh the key, do you put a passphrase or not? (it’s easier without so there are no issues with the SSH agent etc. - anyways the keys are short lived so it shouldn’t be a major problem)

Hi Giovanni,

Thank you for your replies. My answers to your comments:

  • verdi computer show does not show any specific path to the SSH key. In such case, what default path is chosen?
  • the ~/.ssh/conf file contains:
Host ela
  Hostname ela.cscs.ch
  User vdemestr
  IdentityFile ~/.ssh/cscs-key

Host alps
  Hostname daint.alps.cscs.ch
  User vdemestr
  Proxyjump ela
  Forwardagent yes
  IdentityFile ~/.ssh/cscs-key
  AddKeysToAgent yes

where ela is a proxy.

  • verdi computer test fails the first test

  • yes, I add a password

To add some more details:

For privacy reasons, I think it would be better to continue this conversation elsewhere :slight_smile: