Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "SAM Tests"

From EGIWiki
Jump to navigation Jump to search
Line 1: Line 1:
== Introduction ==
== Introduction ==


There are three set of probes:
There are three set of tests:
* Probes executed on SAM/Nagios NGI/ROC instances
* SAM tests: tests executed on NGI/ROC SAM instances
* Probes which raise alarms in the Operations Portal (ROD alarms)
* Operations tests: tests which raise alarms in the Operations Portal
* Probes which are used for availability calculation by the GridView
* Availability&reliability tests: tests which are used for availability calculation by the GridView


These sets are different as they are used for different purposes. Below are description of probes and procedures for adding new probes.
These sets are different as they are used for different purposes. Below are description of tests and procedures for adding new tests.


== SAM/Nagios NGI/ROC instances ==
== SAM tests ==


Probes on SAM/Nagios NGI/ROC instances are the one which frameworks includes in the Nagios configuration. In addition Nagios admins can add their own probes to these instances.  
Tests on NGI/ROC SAM instances are the one which frameworks includes in the SAM configuration. In addition SAM admins can add their own probes to these instances.  


SAM/Nagios teams proposes addition of new probes. The addition of probes is part of SAM/Nagios release and thus part of the staged rollout. It was agreed that prior to release new list of probes will be briefly presented at the OMB meeting.
SAM teams proposes addition of new probes. The addition of probes is part of SAM release and thus part of the staged rollout. It was agreed that prior to release new list of probes will be briefly presented at the OMB meeting. Probes which perform internal components of SAM are not presented at OMB.


== Operations Portal ==
== Operations tests ==


Probes on Operation Portal are the ones used for raising alarms for ROD and COD teams. Operations portal does not execute these probes, but rather receives alarms from NGI/ROC Nagioses. Operations Portal contains list of the probes used for alarms and others are filtered.
Tests on Operations Portal are the ones used for raising alarms for ROD and COD teams. Operations portal does not execute these tests, but receives alarms from NGI/ROC SAM instances. Operations Portal contains list of the probes used for alarms and others are filtered.


The procedure for adding a new probe can be found [[Operations:Procedure_for_setting_Nagios_test_an_operations_test|here]].
The procedure for adding a new probe can be found [[Operations:Procedure_for_setting_Nagios_test_an_operations_test|here]].
Line 22: Line 22:
The list of tests can be found [[Operations:Operations_tests|here]].
The list of tests can be found [[Operations:Operations_tests|here]].


== Availability calculation ==
== Availability&reliability tests ==


Set of probes is used for calculating availability of sites and services. The availability calculation is related to the OLA. As in case of Operations Portal, availability calculation component receives results from NGI/ROC Nagioses.  
Set of tests used for calculating availability and reliability of sites and services. The A/R calculation is related to the OLA. As in case of Operations Portal, availability calculation component receives results from NGI/ROC SAM instances.  


TSA1.8 proposes a change in avail calculation (which probe results count in it) and the OMB approves.
TSA1.8 proposes a change in avail calculation (which probe results count in it) and the OMB approves.


The list of tests can be found [[Availability and reliability tests|here]].
The list of tests can be found [[Availability and reliability tests|here]].

Revision as of 18:33, 14 December 2010

Introduction

There are three set of tests:

  • SAM tests: tests executed on NGI/ROC SAM instances
  • Operations tests: tests which raise alarms in the Operations Portal
  • Availability&reliability tests: tests which are used for availability calculation by the GridView

These sets are different as they are used for different purposes. Below are description of tests and procedures for adding new tests.

SAM tests

Tests on NGI/ROC SAM instances are the one which frameworks includes in the SAM configuration. In addition SAM admins can add their own probes to these instances.

SAM teams proposes addition of new probes. The addition of probes is part of SAM release and thus part of the staged rollout. It was agreed that prior to release new list of probes will be briefly presented at the OMB meeting. Probes which perform internal components of SAM are not presented at OMB.

Operations tests

Tests on Operations Portal are the ones used for raising alarms for ROD and COD teams. Operations portal does not execute these tests, but receives alarms from NGI/ROC SAM instances. Operations Portal contains list of the probes used for alarms and others are filtered.

The procedure for adding a new probe can be found here.

The list of tests can be found here.

Availability&reliability tests

Set of tests used for calculating availability and reliability of sites and services. The A/R calculation is related to the OLA. As in case of Operations Portal, availability calculation component receives results from NGI/ROC SAM instances.

TSA1.8 proposes a change in avail calculation (which probe results count in it) and the OMB approves.

The list of tests can be found here.