Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Cloud SAM tests

From EGIWiki
Revision as of 16:26, 24 October 2014 by Eimamagi (talk | contribs)
Jump to navigation Jump to search
Main EGI.eu operations services Support Documentation Tools Activities Performance Technology Catch-all Services Resource Allocation Security


Tools menu: Main page Instructions for developers AAI Proxy Accounting Portal Accounting Repository AppDB ARGO GGUS GOCDB
Message brokers Licenses OTAGs Operations Portal Perun EGI Collaboration tools LToS EGI Workload Manager


This table lists tests used for monitoring fedcloud resources tools. Tests are executed on the central SAM instance:

Alarms for these tests are opened directly in the Operations Portal Dashboard. POEM profile used on this instance is CLOUD-MON.

Nagios test Frequency Description
eu.egi.cloud.APEL-Pub 12 hours Check looks at the http://goc-accounting.grid-support.ac.uk/cloudtest/cloudsites2.html and checks if the site is there. It also checks lastupdate field and raise:
  • WARNING: if lastupdate is older than 7 days
  • CRITICAL: if lastupdate is older than 30 days

When searching the web site probe uses name provided in the URL in GOCDB entry.

eu.egi.cloud.AppDB-Update 15 minutes Probe gets the hv:imagelist.dc:date:created from the image list (https://vmcaster.appdb.egi.eu/store/vappliance/fedcloud.monitoring.va/image.list) and compares it with the current UTC time:
  • If date_diff > 12h => error
  • else if 6h < date_diff < 12h => warning
  • else => ok
eu.egi.cloud.OCCI-VM 1 h Probe uses OCCI interface to create VM, waits for the VM to become active and then destroys it. In order for the probe to work properly sites need to provide information in the GOCDB URL. URL format is defined here.
eu.egi.cloud.Perun-Check 5 minutes Probe connects to Perun and checks the status by using internal Perun interface.
org.nagios.Broker-TCP 15 min Checks if Broker port (defined in GOCDB URL) is open. Additional documentation: http://nagiosplugins.org/man/check_tcp.
org.nagios.CDMI-TCP 15 min Checks if CDMI port (defined in GOCDB URL) is open. Additional documentation: http://nagiosplugins.org/man/check_tcp.
org.nagios.CloudBDII-Check 15 min Checks if Cloud BDII is running by using -b o=glue and LDAP version 3. Additional documentation: http://nagiosplugins.org/man/check_ldap.
org.nagios.OCCI-TCP 15 min Checks if OCCI port (defined in GOCDB URL) is open. Additional documentation: http://nagiosplugins.org/man/check_tcp.