Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "Tools/Manuals/TS68"

From EGIWiki
Jump to navigation Jump to search
(Created page with '{{TOC_right}} Category:FAQ ------ Back to Troubleshooting Guide ------ = Globus error 22: the job manager failed to create an internal…')
 
 
(2 intermediate revisions by one other user not shown)
Line 1: Line 1:
{{Template:Op menubar}}
{{Template:Doc_menubar}}
[[Category:Operations Manuals]]
{{TOC_right}}
{{TOC_right}}
[[Category:FAQ]]
------
------
Back to [[Tools/Manuals/SiteProblemsFollowUp|Troubleshooting Guide]]
Back to [[Tools/Manuals/SiteProblemsFollowUp|Troubleshooting Guide]]
Line 21: Line 23:
* It does not exist on the CE.
* It does not exist on the CE.
* Permissions are wrong.
* Permissions are wrong.
* Its subdirectory <font face="Courier New,Courier">.globus/job/CE-hostname</font> has reached '''32000''' subdirectories
* Its subdirectory <font face="Courier New,Courier">.globus/job/CE-hostname</font> has reached '''32000''' subdirectories for jobs due to a [https://savannah.cern.ch/bugs/index.php?53358 cleanup bug].  A fix for that is an '''extra cleanup cron job''' provided by lcg-CE version 3.1.35-0 and later.
for jobs due to a [https://savannah.cern.ch/bugs/index.php?53358 cleanup bug].  A fix for
* Some subdirectory contains too many subdirectories.  If this is the case even with the latest lcg-CE version, check <font face="Courier New,Courier">/var/log/cleanup-grid-accounts.log</font> to see if and how <font face="Courier New,Courier">/opt/lcg/sbin/cleanup-grid-accounts.sh</font> is being run regularly to clean out the home directories of grid users.  By default it will only clean out pool account directories, so make sure that extra accounts that need cleaning are listed in <font face="Courier New,Courier">/opt/lcg/etc/cleanup-grid-accounts.conf</font> like this:
that is an '''extra cleanup cron job''' provided by lcg-CE version 3.1.35-0 and later.
* Some subdirectory contains too many subdirectories.  If this is the case even with the
latest lcg-CE version, check <font face="Courier New,Courier">/var/log/cleanup-grid-accounts.log</font> to see if and how
<font face="Courier New,Courier">/opt/lcg/sbin/cleanup-grid-accounts.sh</font> is being run regularly to clean out the home
directories of grid users.  By default it will only clean out pool account directories,
so make sure that extra accounts that need cleaning are listed in
<font face="Courier New,Courier">/opt/lcg/etc/cleanup-grid-accounts.conf</font> like this:


  EXTRA='account1 account2 account3'
  EXTRA='account1 account2 account3'


Note that YAIM automatically lists all the relevant accounts from <font face="Courier New,Courier">users.conf</font>.
Note that YAIM automatically lists all the relevant accounts from <font face="Courier New,Courier">users.conf</font>.

Latest revision as of 13:46, 23 November 2012

Main EGI.eu operations services Support Documentation Tools Activities Performance Technology Catch-all Services Resource Allocation Security


Documentation menu: Home Manuals Procedures Training Other Contact For: VO managers Administrators



Back to Troubleshooting Guide


Globus error 22: the job manager failed to create an internal script argument file

Full message

As seen in Condor logs as used in ATLAS pilot submission or in the output of glite-wms-job-logging-info -v 2 for a gLite WMS job:

018 (9163559.000.000) 03/05 11:25:27 Globus job submission failed!
    Reason: 22 the job manager failed to create an internal script argument file

Diagnosis

This can be due to problems with the home directory for the user:

  • It does not exist on the CE.
  • Permissions are wrong.
  • Its subdirectory .globus/job/CE-hostname has reached 32000 subdirectories for jobs due to a cleanup bug. A fix for that is an extra cleanup cron job provided by lcg-CE version 3.1.35-0 and later.
  • Some subdirectory contains too many subdirectories. If this is the case even with the latest lcg-CE version, check /var/log/cleanup-grid-accounts.log to see if and how /opt/lcg/sbin/cleanup-grid-accounts.sh is being run regularly to clean out the home directories of grid users. By default it will only clean out pool account directories, so make sure that extra accounts that need cleaning are listed in /opt/lcg/etc/cleanup-grid-accounts.conf like this:
EXTRA='account1 account2 account3'

Note that YAIM automatically lists all the relevant accounts from users.conf.