Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "Tools/Manuals/TS68"

From EGIWiki
Jump to navigation Jump to search
(Created page with '{{TOC_right}} Category:FAQ ------ Back to Troubleshooting Guide ------ = Globus error 22: the job manager failed to create an internal…')
 
m
Line 21: Line 21:
* It does not exist on the CE.
* It does not exist on the CE.
* Permissions are wrong.
* Permissions are wrong.
* Its subdirectory <font face="Courier New,Courier">.globus/job/CE-hostname</font> has reached '''32000''' subdirectories
* Its subdirectory <font face="Courier New,Courier">.globus/job/CE-hostname</font> has reached '''32000''' subdirectories for jobs due to a [https://savannah.cern.ch/bugs/index.php?53358 cleanup bug].  A fix for that is an '''extra cleanup cron job''' provided by lcg-CE version 3.1.35-0 and later.
for jobs due to a [https://savannah.cern.ch/bugs/index.php?53358 cleanup bug].  A fix for
that is an '''extra cleanup cron job''' provided by lcg-CE version 3.1.35-0 and later.
* Some subdirectory contains too many subdirectories.  If this is the case even with the
* Some subdirectory contains too many subdirectories.  If this is the case even with the
latest lcg-CE version, check <font face="Courier New,Courier">/var/log/cleanup-grid-accounts.log</font> to see if and how
latest lcg-CE version, check <font face="Courier New,Courier">/var/log/cleanup-grid-accounts.log</font> to see if and how

Revision as of 09:22, 26 May 2011


Back to Troubleshooting Guide


Globus error 22: the job manager failed to create an internal script argument file

Full message

As seen in Condor logs as used in ATLAS pilot submission or in the output of glite-wms-job-logging-info -v 2 for a gLite WMS job:

018 (9163559.000.000) 03/05 11:25:27 Globus job submission failed!
    Reason: 22 the job manager failed to create an internal script argument file

Diagnosis

This can be due to problems with the home directory for the user:

  • It does not exist on the CE.
  • Permissions are wrong.
  • Its subdirectory .globus/job/CE-hostname has reached 32000 subdirectories for jobs due to a cleanup bug. A fix for that is an extra cleanup cron job provided by lcg-CE version 3.1.35-0 and later.
  • Some subdirectory contains too many subdirectories. If this is the case even with the

latest lcg-CE version, check /var/log/cleanup-grid-accounts.log to see if and how /opt/lcg/sbin/cleanup-grid-accounts.sh is being run regularly to clean out the home directories of grid users. By default it will only clean out pool account directories, so make sure that extra accounts that need cleaning are listed in /opt/lcg/etc/cleanup-grid-accounts.conf like this:

EXTRA='account1 account2 account3'

Note that YAIM automatically lists all the relevant accounts from users.conf.