Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "PROC10 Recomputation of SAM results or availability reliability statistics"

From EGIWiki
Jump to navigation Jump to search
(Remove deprecated content)
Tag: Replaced
 
(82 intermediate revisions by 8 users not shown)
Line 1: Line 1:
{{Template:Op menubar}}
{{Template:Op menubar}} {{Template:Doc_menubar}} {{TOC_right}}  
{{Template:Doc_menubar}}
[[Category:Deprecated]]
[[Category:Procedures]]
{| style="border:1px solid black; background-color:lightgrey; color: black; padding:5px; font-size:140%; width: 90%; margin: auto;"
__TOC__
| style="padding-right: 15px; padding-left: 15px;" |
 
|[[File:Alert.png]] This page is '''Deprecated'''; the content has been moved to https://confluence.egi.eu/display/EGIPP/PROC10+Recomputation+of+SAM+results+or+availability+reliability+statistics 
= Procedurerecomputation of SAM results and availability/reliability =
|}
 
*'''Title''': Recomputation of monitoring results and  availability
*'''Document link''':  
*'''Last modified''':
*'''Version''': 1.0
*'''Policy Group Acronym''': OMB
*'''Policy Group Name''': Operations Management Board
*'''Contact Person''': Dimitris Zilaskos
*'''Document Status''': DRAFT
*'''Approved Date''':
*'''Procedure Statement''':The purpose of this document is to ...
 
= Overview  =
This procedure documents the steps for requesting a correction in the
[[SAM_Instances|SAM test results]] and in the related [[Availability_and_reliability_monthly_statistics|availability statistics]].
 
= Prerequisites =
Fixes in test results are accepted only when failures in test results were due to problems
cased to the monitoring infrastructure itself. Some examples:
* invalid proxy certificate used for submitting the monitoring probes in a Nagios instance;
* problems with the Storage Element used for replica management tests resulting in errors on CE's metrics.
 
= Steps  =
 
# '''STEP 1''': notify your Operations Centre by opening a [http://helpdesk.egi.eu/ GGUS ticket] to be assigned to your Operations Centre Support Unit.
# '''STEP 2''': the Operations Centre anlayzes the request. If the request is validated, the ticket is re-assigned to the [[GGUS:SLM-FAQ|Service Level Management]](SLM) Support Unit, who will be responsible of (1) collecting all reported problems and (2) discuss the reported problems with the SAM Support Unit by re-assigning the ticket to the SAM/Nagios SU.
# '''STEP 3''': if the request for recomputation of the test results is accepted, the SAM Support Unit will be reponsible of triggering a recomputation of the monthly availability statistics.
# '''STEP 4''': when the new availability statistics are ready for distribution, the SAM/Nagios SU reassignes the ticket to the SLM Support Unit, in order to notify that a new set of reports can be re-distributed to EGI.
 
= Externar links =
* [https://tomtools.cern.ch/confluence/display/SAMDOC/Availability+Re-computation+Policy WLCG Availability re-computation policy]
 
= Revision history  =

Latest revision as of 10:43, 15 April 2022