ROC SAM Tests

From EGIWiki
Jump to: navigation, search
Main EGI.eu operations services Support Documentation Tools Activities Performance Technology Catch-all Services Resource Allocation Security


Tools menu: Main page Instructions for developers AAI Proxy Accounting Portal Accounting Repository AppDB ARGO GGUS GOCDB
Message brokers Licenses OTAGs Operations Portal Perun EGI Collaboration tools LToS EGI Workload Manager


Contents


Alert.png This article is Deprecated and should no longer be used, but is still available for reasons of reference.

IMPORTANT: Description of metrics is not maintained on this page anymore. Please use POEM directly to find relevant information:

Obsolete

This table lists tests used for monitoring all EGI services; applied on all NGI SAM Nagioses. POEM profile used on these instances is ROC.

Service Type Metric Name Package GGUS SU Description
APEL

Tests run centrally in the APEL server and query the central APEL database for publishing details of every production/certified site registered in GOCDB with at least one CE defined as an "APEL" service type

org.apel.APEL-Pub
N/A APEL The APEL-Pub test checks for the latest successfull published date for each site. https://wiki.egi.eu/wiki/APEL/Tests#APEL-Pub
org.apel.APEL-Sync
N/A APEL The APEL-Sync test compares the number of records in the site's local database with the number of records published to the central APEL database by month. https://wiki.egi.eu/wiki/APEL/Tests#APEL-Sync
ARC-CE
org.nordugrid.ARC-CE-ARIS
nordugrid-arc-nagios-plugins DMSU http://git.nbi.ku.dk/downloads/NorduGridARCNagiosPlugins/arcinfosys.html#ce-infosys-validation-for-the-nordugrid-and-glue-1-schemas
org.nordugrid.ARC-CE-IGTF
nordugrid-arc-nagios-plugins DMSU http://wiki.nordugrid.org/wiki/Service_Monitoring#Configuration_and_deployment_of_the_ARC_probes
org.nordugrid.ARC-CE-LFC-result
nordugrid-arc-nagios-plugins DMSU http://wiki.nordugrid.org/wiki/Service_Monitoring#Configuration_and_deployment_of_the_ARC_probes
org.nordugrid.ARC-CE-LFC-submit
nordugrid-arc-nagios-plugins DMSU http://wiki.nordugrid.org/wiki/Service_Monitoring#Configuration_and_deployment_of_the_ARC_probes
org.nordugrid.ARC-CE-SRM-result
nordugrid-arc-nagios-plugins DMSU http://git.nbi.ku.dk/downloads/NorduGridARCNagiosPlugins/gridstorage.html
org.nordugrid.ARC-CE-SRM-submit
nordugrid-arc-nagios-plugins DMSU http://git.nbi.ku.dk/downloads/NorduGridARCNagiosPlugins/gridstorage.html
org.nordugrid.ARC-CE-lfc
nordugrid-arc-nagios-plugins DMSU http://wiki.nordugrid.org/wiki/Service_Monitoring#Configuration_and_deployment_of_the_ARC_probes
org.nordugrid.ARC-CE-result
nordugrid-arc-nagios-plugins DMSU http://wiki.nordugrid.org/wiki/Service_Monitoring#Configuration_and_deployment_of_the_ARC_probes
org.nordugrid.ARC-CE-srm
nordugrid-arc-nagios-plugins DMSU http://wiki.nordugrid.org/wiki/Service_Monitoring#Configuration_and_deployment_of_the_ARC_probes
org.nordugrid.ARC-CE-submit
nordugrid-arc-nagios-plugins DMSU http://git.nbi.ku.dk/downloads/NorduGridARCNagiosPlugins/arcce.html
org.nordugrid.ARC-CE-sw-csh
nordugrid-arc-nagios-plugins DMSU http://wiki.nordugrid.org/wiki/Service_Monitoring#Configuration_and_deployment_of_the_ARC_probes
org.nordugrid.ARC-CE-sw-gcc
nordugrid-arc-nagios-plugins DMSU http://wiki.nordugrid.org/wiki/Service_Monitoring#Configuration_and_deployment_of_the_ARC_probes
org.nordugrid.ARC-CE-sw-perl
nordugrid-arc-nagios-plugins DMSU http://wiki.nordugrid.org/wiki/Service_Monitoring#Configuration_and_deployment_of_the_ARC_probes
org.nordugrid.ARC-CE-sw-python
nordugrid-arc-nagios-plugins DMSU http://wiki.nordugrid.org/wiki/Service_Monitoring#Configuration_and_deployment_of_the_ARC_probes
CREAM-CE
emi.cream.CREAMCE-AllowedSubmission
emi-cream-nagios DMSU https://wiki.italiangrid.it/twiki/bin/view/CREAM/DjsCreamProbeNew#cream_allowedSubmission
emi.cream.CREAMCE-JobCancel
emi-cream-nagios DMSU https://wiki.italiangrid.it/twiki/bin/view/CREAM/DjsCreamProbeNew#cream_jobCancel_py
emi.cream.CREAMCE-JobPurge
emi-cream-nagios DMSU https://wiki.italiangrid.it/twiki/bin/view/CREAM/DjsCreamProbeNew#cream_jobPurge_py
emi.cream.CREAMCE-ServiceInfo
emi-cream-nagios DMSU https://wiki.italiangrid.it/twiki/bin/view/CREAM/DjsCreamProbeNew#cream_serviceInfo_py
eu.egi.CREAM-IGTF
nagios-plugins-igtf ARGO Probe check CA distribution installed at TLS-enabled endpoint by retrieving the list of CA DNs available. List of available CA DNs is compared to the valid list (http://repository.egi.eu/sw/production/cas/1/current/meta/ca-policy-egi-core.subjectdn) and obsolete list (http://repository.egi.eu/sw/production/cas/1/current/meta/ca-policy-egi-core.obsoleted-subjectdn) both for current and previous official versions. In case previous version is detected probe will return CRITICAL if the official version is older than 8 days and WARNING if the official version is older than 3 days. In case discrepancies are found with the both previous and current official versions, probe will return CRITICAL. In case endpoint is not TLS-enabled probe will return OK.
hr.srce.CREAMCE-CertLifetime
nagios-plugins-cert ARGO https://github.com/ARGOeu/nagios-plugins-cert
Central-LFC
ch.cern.LFC-Ping
nagios-plugins-lfc ARGO Ping LFC service (service level ping).
ch.cern.LFC-Read
nagios-plugins-lfc ARGO Test if an entry in the catalog can be read.
ch.cern.LFC-Write
nagios-plugins-lfc ARGO Test if the modification time of an entry in the catalog can be updated.
org.sam.LFC-Cleanup
nagios-plugins-lfc ARGO Clean test area on LFC
FTS
hr.srce.FTS-CertLifetime
nagios-plugins-cert ARGO https://github.com/ARGOeu/nagios-plugins-cert
GRAM5
hr.srce.GRAM-Auth
nagios-plugins-globus ARGO https://github.com/ARGOeu/nagios-plugins-globus
hr.srce.GRAM-CertLifetime
nagios-plugins-cert ARGO https://github.com/ARGOeu/nagios-plugins-cert
hr.srce.GRAM-Command
nagios-plugins-globus ARGO https://github.com/ARGOeu/nagios-plugins-globus
LB
hr.srce.LB-CertLifetime
nagios-plugins-cert ARGO https://github.com/ARGOeu/nagios-plugins-cert
Local-LFC
ch.cern.LFC-Ping
nagios-plugins-lfc ARGO Ping LFC service (service level ping).
ch.cern.LFC-Read
nagios-plugins-lfc ARGO Test if an entry in the catalog can be read.
MyProxy
hr.srce.MyProxy-CertLifetime
nagios-plugins-cert ARGO https://github.com/ARGOeu/nagios-plugins-cert
hr.srce.MyProxy-Store
nagios-plugins-globus ARGO https://github.com/ARGOeu/nagios-plugins-globus
QCG.Broker
hr.srce.QCG-Broker-CertLifetime
nagios-plugins-cert ARGO https://github.com/ARGOeu/nagios-plugins-cert
pl.plgrid.QCG-Broker
qcg-broker-nagios-probe DMSU http://www.qoscosgrid.org/trac/qcg-broker/wiki/NagiosProbes
QCG.Computing
hr.srce.QCG-Computing-CertLifetime
nagios-plugins-cert ARGO https://github.com/ARGOeu/nagios-plugins-cert
pl.plgrid.QCG-Computing
qcg-comp-nagios-probe DMSU http://www.qoscosgrid.org/trac/qcg-computing/wiki/NagiosProbes
QCG.Notification
pl.plgrid.QCG-Notification
qcg-ntf-nagios-probe DMSU http://www.qoscosgrid.org/trac/qcg-notification/wiki/NagiosProbes
SRMv2
hr.srce.SRM2-CertLifetime
nagios-plugins-cert ARGO https://github.com/ARGOeu/nagios-plugins-cert
org.sam.SRM-All
emi.dcache.srm-probes DMSU http://argoeu.github.io/samdoc/confluence/display/SAMDOC/SRM.html
org.sam.SRM-Del
emi.dcache.srm-probes DMSU http://argoeu.github.io/samdoc/confluence/display/SAMDOC/SRM.html
org.sam.SRM-Get
emi.dcache.srm-probes DMSU http://argoeu.github.io/samdoc/confluence/display/SAMDOC/SRM.html
org.sam.SRM-GetTURLs
emi.dcache.srm-probes DMSU http://argoeu.github.io/samdoc/confluence/display/SAMDOC/SRM.html
org.sam.SRM-Ls
emi.dcache.srm-probes DMSU http://argoeu.github.io/samdoc/confluence/display/SAMDOC/SRM.html
org.sam.SRM-LsDir
emi.dcache.srm-probes DMSU http://argoeu.github.io/samdoc/confluence/display/SAMDOC/SRM.html
org.sam.SRM-Put
emi.dcache.srm-probes DMSU http://argoeu.github.io/samdoc/confluence/display/SAMDOC/SRM.html
Site-BDII
org.bdii.Entries
nagios-plugins-bdii DMSU
org.bdii.Freshness
nagios-plugins-bdii DMSU
org.bdii.GLUE2-Validate
glue-validator DMSU http://gridinfo.web.cern.ch/glue/glue-validator-guide
https://twiki.cern.ch/twiki/bin/view/EGEE/GLUEValidatorErrorCodes
org.nagios.BDII-Check
nagios-plugins-ldap N/A http://nagiosplugins.org/man/check_ldap
Top-BDII
org.bdii.Entries
nagios-plugins-bdii DMSU
org.bdii.Freshness
nagios-plugins-bdii DMSU
org.nagios.BDII-Check
nagios-plugins-ldap N/A http://nagiosplugins.org/man/check_ldap
VOMS
hr.srce.VOMS-CertLifetime
nagios-plugins-cert ARGO https://github.com/ARGOeu/nagios-plugins-cert
WMS
emi.wms.WMS-JobSubmit
emi-wms-nagios DMSU https://tomtools.cern.ch/confluence/display/SAM/WMS
hr.srce.WMProxy-CertLifetime
nagios-plugins-cert ARGO https://github.com/ARGOeu/nagios-plugins-cert
org.nagios.GridFTP-Check
nagios-plugins-ftp N/A http://nagiosplugins.org/man/check_ftp
dg.ARC-CE
dg.FinishedJobs N/A DMSU http://www.edgi-grid.eu/downloads/EGI-EDGI-Monitoring/egi-edgi-monitoring-documentation.txt
dg.CREAM-CE
dg.FinishedJobs N/A DMSU http://www.edgi-grid.eu/downloads/EGI-EDGI-Monitoring/egi-edgi-monitoring-documentation.txt
dg.TargetSystemFactory
dg.FinishedJobs N/A DMSU http://www.edgi-grid.eu/downloads/EGI-EDGI-Monitoring/egi-edgi-monitoring-documentation.txt
eu.egi.MPI
eu.egi.mpi.EnvSanityCheck (removed)
N/A N/A https://wiki.egi.eu/wiki/VT_MPI_within_EGI:Nagios
eu.egi.mpi.complexjob.CREAMCE-JobSubmit (removed)
N/A N/A https://wiki.egi.eu/wiki/VT_MPI_within_EGI:Nagios
eu.egi.mpi.complexjob.WN (removed)
N/A N/A https://wiki.egi.eu/wiki/VT_MPI_within_EGI:Nagios
eu.egi.mpi.simplejob.CREAMCE-JobSubmit (removed)
N/A N/A https://wiki.egi.eu/wiki/VT_MPI_within_EGI:Nagios
eu.egi.mpi.simplejob.WN (removed)
N/A N/A https://wiki.egi.eu/wiki/VT_MPI_within_EGI:Nagios
globus-GRIDFTP
hr.srce.GridFTP-Transfer
nagios-plugins-globus ARGO https://github.com/ARGOeu/nagios-plugins-globus
org.nagios.GridFTP-Check
nagios-plugins-ftp N/A http://nagiosplugins.org/man/check_ftp
globus-GSISSHD
org.nagios.gsissh-Check
nagios-plugins-ssh N/A http://nagiosplugins.org/man/check_ssh
unicore6.Gateway
emi.unicore.Gateway
unicore-nagios-plugins DMSU http://unicore-life.svn.sourceforge.net/viewvc/unicore-life/monitoring/UMI-Probes/trunk/umi2/check_gateway/check_gateway.html
unicore6.Registry
emi.unicore.Registry
unicore-nagios-plugins DMSU http://unicore-life.svn.sourceforge.net/viewvc/unicore-life/monitoring/UMI-Probes/trunk/umi2/check_registry/check_registry.html
unicore6.ServiceOrchestrator
emi.unicore.ServiceOrchestrator
unicore-nagios-plugins DMSU http://unicore-life.svn.sourceforge.net/viewvc/unicore-life/monitoring/UMI-Probes/trunk/umi2/check_servorch/check_servorch.html
unicore6.StorageFactory
emi.unicore.StorageFactory
unicore-nagios-plugins DMSU http://unicore-life.svn.sourceforge.net/viewvc/unicore-life/monitoring/UMI-Probes/trunk/umi2/check_storagefactory/check_storagefactory.html
unicore6.StorageManagement
emi.unicore.GlobalStorage
unicore-nagios-plugins DMSU http://unicore-life.svn.sourceforge.net/viewvc/unicore-life/monitoring/UMI-Probes/trunk/umi2/check_sms/check_sms.html
emi.unicore.GlobalStorage-FreeSpace
unicore-nagios-plugins DMSU http://unicore-life.svn.sourceforge.net/viewvc/unicore-life/monitoring/UMI-Probes/trunk/umi2/check_freespace/check_freespace.html
unicore6.TargetSystemFactory
emi.unicore.TargetSystemFactory
unicore-nagios-plugins DMSU http://unicore-life.svn.sourceforge.net/viewvc/unicore-life/monitoring/UMI-Probes/trunk/umi2/check_unicorex/check_unicorex.html
emi.unicore.UNICORE-Job
unicore-nagios-plugins DMSU http://unicore-life.svn.sourceforge.net/viewvc/unicore-life/monitoring/UMI-Probes/trunk/umi2/check_application/check_application.html
unicore6.UVOSAssertionQueryService
emi.unicore.UVOS
unicore-nagios-plugins DMSU http://unicore-life.svn.sourceforge.net/viewvc/unicore-life/monitoring/UMI-Probes/trunk/umi2/check_uvos/check_uvos.html
unicore6.WorkflowFactory
emi.unicore.WorkflowService
unicore-nagios-plugins DMSU http://unicore-life.svn.sourceforge.net/viewvc/unicore-life/monitoring/UMI-Probes/trunk/umi2/check_workflowservice/check_workflowservice.html
OPS tools
org.nagios.CertLifetime
nagios-plugins-http N/A https://www.monitoring-plugins.org/doc/man/check_http.html
/usr/lib64/nagios/plugins/check_http -H HOSTNAME -t 60 -J NAGIOSKEY -K NAGIOSCERT --sni -C 30
lowAvailability
egi.eu.lowAvailability
local script / Operations Portal Operations Portal Script querying ARGO and raising a fake alarm in case of low results (Availability or Reliability - average values for the last 30 days )