Difference between revisions of "Tools/Manuals/SiteProblemsFollowUp"
< Tools
Jump to navigation
Jump to search
Line 30: | Line 30: | ||
== Workload Management == | == Workload Management == | ||
A job submitted via WMS/Condor-G/RB to an LCG-CE failed. | A job submitted via WMS/Condor-G/RB to an LCG-CE failed. | ||
# [[Tools/Manuals/TS50|10 data transfer to the server failed]] | |||
# [[Tools/Manuals/TS51|Cannot read JobWrapper output...]] | |||
# [[Tools/Manuals/TS52|Cannot download .BrokerInfo]] | |||
# [[Tools/Manuals/TS53|BrokerHelper: no compatible resources]] | |||
# [[Tools/Manuals/TS54|request expired]] | |||
# [[Tools/Manuals/TS55|Jobs sent to my WMS stay in Waiting state forever]] | |||
# [[Tools/Manuals/TS56|Jobs sent to some CE stay in Ready state forever]] | |||
# [[Tools/Manuals/TS57|Jobs sent to some CE stay in Scheduled state forever]] | |||
# [[Tools/Manuals/TS58|Jobs sent to some CE stay in Running state forever]] | |||
# [[Tools/Manuals/TS59|444444 waiting jobs]] | |||
# [[Tools/Manuals/TS60|ssh problem from WN to CE]] | |||
# [[Tools/Manuals/TS61|Proxy expired]] | |||
# [[Tools/Manuals/TS62|Globus error 3]] | |||
# [[Tools/Manuals/TS63|submit-helper script ... gave error: cache export dir ...]] | |||
# [[Tools/Manuals/TS64|8 the user cancelled the job]] | |||
# [[Tools/Manuals/TS65|43 the job manager failed to stage the executable]] | |||
# [[Tools/Manuals/TS66|Globus error 17: the job failed when the job manager attempted to run it]] | |||
# [[Tools/Manuals/TS67|Globus error 21: the job manager failed to locate an internal script argument file]] | |||
# [[Tools/Manuals/TS68|Globus error 22: the job manager failed to create an internal script argument file]] | |||
# [[Tools/Manuals/TS69|Globus error 24: the job manager detected an invalid script response]] | |||
# [[Tools/Manuals/TS70|Globus error 25: the job manager detected an invalid script status]] | |||
# [[Tools/Manuals/TS71|Globus error 79: connecting to the job manager failed.]] | |||
# [[Tools/Manuals/TS72|Globus error 94: the jobmanager does not accept any new requests (shutting down)]] | |||
# [[Tools/Manuals/TS73|Globus error 155: the job manager could not stage out a file]] | |||
# [[Tools/Manuals/TS74|Globus error 158: the job manager could not lock the state lock file]] | |||
# [[Tools/Manuals/TS75|Unspecified gridmanager error]] | |||
# [[Tools/Manuals/TS76|Job got an error while in the CondorG queue]] | |||
# [[Tools/Manuals/TS77|GRAM Job submission failed because the job manager failed to open stderr (error code 74)]] | |||
# [[Tools/Manuals/TS78|MPI. ssh: connect to host <hname> port 22: No route to host]] | |||
# [[Tools/Manuals/TS79|Generic verification error for VOMS (failure)!]] | |||
# [[Tools/Manuals/TS80|globus-job-run returns nothing]] | |||
# [[Tools/Manuals/TS81|Cannot take token!]] | |||
# [[Tools/Manuals/TS82|JS always fails with 'user proxy expired' message]] | |||
# [[Tools/Manuals/TS83|lcgpbs job manager cancels all jobs]] | |||
# [[Tools/Manuals/TS84|Lots of <defunct> processes from globus-gma]] | |||
== Data Management == | == Data Management == |
Revision as of 11:14, 25 May 2011
Main | EGI.eu operations services | Support | Documentation | Tools | Activities | Performance | Technology | Catch-all Services | Resource Allocation | Security |
Tools menu: | • Main page | • Instructions for developers | • AAI Proxy | • Accounting Portal | • Accounting Repository | • AppDB | • ARGO | • GGUS | • GOCDB |
• Message brokers | • Licenses | • OTAGs | • Operations Portal | • Perun | • EGI Collaboration tools | • LToS | • EGI Workload Manager |
Troubleshooting Guide about Operational Errors on LCG Sites
Authentication
Problem with host certificate (expired, etc.) or with authentication.
- TS01: 7 authentication failed
- TS02: 530 530 LCMAPS credential mapping NOT successful
- TS03: 530 530 No local mapping for Globus ID
- TS04: 530-Login incorrect
- TS05: Proxy expired
- TS06: 501 501-FTPD GSSAPI error: GSS Major Status: General failure
- TS07: 535 535-FTPD GSSAPI error: GSS Major Status: General failure
- TS08: Invalid CRL: The available CRL has expired
- TS09: Certificate proxy not yet valid
- TS10: sslv3 alert bad certificate
- TS11: GRAM Authentication test failure:
- TS12: No valid credential found ... Bad magic number
- TS13: Generic verification error for VOMS (failure)!
- TS14: Host certificate update
- TS15: failed unwrapping ENC message
- TS16: failed unwrapping MIC message
- TS17: gss_unwrap: internal problem with SSL BIO
- TS18: no passphrase authentication failed
Workload Management
A job submitted via WMS/Condor-G/RB to an LCG-CE failed.
- 10 data transfer to the server failed
- Cannot read JobWrapper output...
- Cannot download .BrokerInfo
- BrokerHelper: no compatible resources
- request expired
- Jobs sent to my WMS stay in Waiting state forever
- Jobs sent to some CE stay in Ready state forever
- Jobs sent to some CE stay in Scheduled state forever
- Jobs sent to some CE stay in Running state forever
- 444444 waiting jobs
- ssh problem from WN to CE
- Proxy expired
- Globus error 3
- submit-helper script ... gave error: cache export dir ...
- 8 the user cancelled the job
- 43 the job manager failed to stage the executable
- Globus error 17: the job failed when the job manager attempted to run it
- Globus error 21: the job manager failed to locate an internal script argument file
- Globus error 22: the job manager failed to create an internal script argument file
- Globus error 24: the job manager detected an invalid script response
- Globus error 25: the job manager detected an invalid script status
- Globus error 79: connecting to the job manager failed.
- Globus error 94: the jobmanager does not accept any new requests (shutting down)
- Globus error 155: the job manager could not stage out a file
- Globus error 158: the job manager could not lock the state lock file
- Unspecified gridmanager error
- Job got an error while in the CondorG queue
- GRAM Job submission failed because the job manager failed to open stderr (error code 74)
- MPI. ssh: connect to host <hname> port 22: No route to host
- Generic verification error for VOMS (failure)!
- globus-job-run returns nothing
- Cannot take token!
- JS always fails with 'user proxy expired' message
- lcgpbs job manager cancels all jobs
- Lots of <defunct> processes from globus-gma
Data Management
A Data Management command failed.
- TS21: lcg cr: Invalid argument
- TS22: 425 425 Can't open data connection. timed out() failed.
- TS23: gridftp works only once within a minute or so
- TS24: LFC and DPM troubleshooting page
- TS12: No valid credential found ... Bad magic number
- TS26: No valid credential found ... System error
- TS27: Could not establish context
- TS28: Transport endpoint is not connected
- TS29: Unknown error ... Communication error on send