Tools/Manuals/TS60

From EGIWiki
Jump to: navigation, search
Main EGI.eu operations services Support Documentation Tools Activities Performance Technology Catch-all Services Resource Allocation Security


Documentation menu: Home Manuals Procedures Training Other Contact For: VO managers Administrators

Contents



Back to Troubleshooting Guide


ssh problem from WN to CE

Full message

Various error messages, usually not directly showing an ssh/scp problem; see other job submission errors.

Before a job starts, the batch system needs to copy the job wrapper script and the user proxy to the WN. It is normal for Torque/PBS and possibly other batch systems to rely on scp for letting the WN copy those files from the CE host before the real job gets started.

Similarly, after the job has finished, the stdout and stderr of the job wrapper need to be copied to the CE.

The various scp invocations may fail for several reasons. If the failures are intermittent, then the SSH daemon on the CE may not have been configured to allow sufficient simultaneous connections.

Diagnosis

Solution

Possible problem with duplicate entries for the WNs in the CE ssh configuration.

/usr/sbin/edg-pbs-knownhosts
/usr/sbin/edg-pbs-shostsequiv
/usr/sbin/edg-pbs-knownhosts

If insufficient connections are allowed to the SSH daemon on the CE:

MaxStartups 100
Personal tools
Namespaces
Variants
Actions
Navigation
Toolbox
Print/export