Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

SAM Tests

From EGIWiki
Revision as of 18:45, 19 August 2010 by Eimamagi (talk | contribs)
Jump to navigation Jump to search

Introduction

There are three set of probes:

  • Probes executed on SAM/Nagios NGI/ROC instances
  • Probes which raise alarms in the Operations Portal (ROD alarms)
  • Probes which are used for availability calculation by the GridView

These sets are different as they are used for different purposes. Below are description of probes and procedures for adding new probes.

SAM/Nagios NGI/ROC instances

Probes on SAM/Nagios NGI/ROC instances are the one which frameworks includes in the Nagios configuration. In addition Nagios admins can add their own probes to these instances.

SAM/Nagios teams proposes addition of new probes. The addition of probes is part of SAM/Nagios release and thus part of the staged rollout. It was agreed that prior to release new list of probes will be briefly presented at the OMB meeting.

Operations Portal

Probes on Operation Portal are the ones used for raising alarms for ROD and COD teams. Operations portal does not execute these probes, but rather receives alarms from NGI/ROC Nagioses. Operations Portal contains list of the probes used for alarms and others are filtered.

The proposal of turning a test into critical is made to the OMB and the OMB approves/rejects it. The proposal can come from any partner.

Availability calculation

Set of probes is used for calculating availability of sites and services. The availability calculation is related to the OLA. As in case of Operations Portal, availability calculation component receives results from NGI/ROC Nagioses.

TSA1.8 proposes a change in avail calculation (which probe results count in it) and the OMB approves.