Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "PROC10 Recomputation of SAM results or availability reliability statistics"

From EGIWiki
Jump to navigation Jump to search
(Remove deprecated content)
Tag: Replaced
 
(12 intermediate revisions by 2 users not shown)
Line 1: Line 1:
{{Template:Op menubar}} {{Template:Doc_menubar}} {{TOC_right}}  
{{Template:Op menubar}} {{Template:Doc_menubar}} {{TOC_right}}  
 
[[Category:Deprecated]]
{{Ops_procedures
{| style="border:1px solid black; background-color:lightgrey; color: black; padding:5px; font-size:140%; width: 90%; margin: auto;"
|Doc_title = Recomputation of availability/reliability statistics
| style="padding-right: 15px; padding-left: 15px;" |
|Doc_link = [[PROC10|https://wiki.egi.eu/wiki/PROC10]]
|[[File:Alert.png]] This page is '''Deprecated'''; the content has been moved to https://confluence.egi.eu/display/EGIPP/PROC10+Recomputation+of+SAM+results+or+availability+reliability+statistics 
|Version = 08.06.2016
|Policy_acronym = OMB
|Policy_name = Operations Management Board
|Contact_group = operations@egi.eu
|Doc_status = Approved
|Approval_date = 29.10.2015
|Procedure_statement = This procedure documents the steps for requesting a correction in the SAM test results and in the related availability/reliability statistics.
|Owner = Alessandro Paolini
}}
 
<br>
 
= Overview  =
 
This procedure documents the steps for requesting a correction in the OPS VO [https://wiki.egi.eu/wiki/SAM_Instances SAM&nbsp;test results] and in the related [https://wiki.egi.eu/wiki/Availability_and_reliability_monthly_statistics availability/reliability statistics] if applicable.
 
Figures are available trough the web interface&nbsp;: http://argo.egi.eu
 
DISCLAIMER: This procedure is only applicable to EGI OPS test results. Procedures for the computation of VO-specific availability report are VO-specific and are out of this scope.
 
= Definitions  =
 
Please refer to the [[Glossary|EGI Glossary]] for the definitions of the terms used in this procedure.
 
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", “MAY", and "OPTIONAL" in this document are to be interpreted as described in RFC 2119.
 
= Who can submit a request?  =
 
Re-computations can be requested by:  
 
*site administrators
*regional operations staff.
 
= Re-computation policy  =
 
Starting from the 01 May 2012:
 
*monitoring '''results can be recomputed only in the case of problems with the monitoring infrastructure itself. '''<br>
*'''No re-computations will be performed in case of issues with the deployed middleware '''(e.g. in case of documented bugs affecting the availability of a production service end-point), which will be consequently reflected in lower availability/reliability.
 
<br>
 
Some examples of possible issues justifying a re-computation request:
 
*invalid proxy certificate used for submitting the monitoring probes in a Nagios instance;  
*problems with the Storage Element used for replica management tests resulting in errors on CE's metrics.<br>
 
<br>
 
'''The deadline: '''10 calendar days after the publication and announcement of the monthly Availability/Reliability reports for a given month X (typically the announcement will be distributed on the 1st day of month X+1). <br>
 
According to the re-computation requests received, A/R reports will be regenerated only once for each month, after the 10th of month X+1.
 
= How to request a re-computation of OPS monitoring results  =
 
{| class="wikitable"
|-
! Step
! Action on
! Action
|-
| 1
| Site administrator / ROD team
|
As soon as the problem is detected, please fill this form&nbsp;: [http://argo.egi.eu/lavoisier/recomputation http://argo.egi.eu/lavoisier/recomputation] and indicate<br>
 
#Your name and email
#the site(s) affectected by the problem
#a description of the problem
#the profile affected
#the starting and ending time of the problem (including day and hour in UTC)<br>
 
The submission of the form will inform the ARGO / EGI Operations team and your request will be in pending status.
 
|-
| 2
| ARGO&nbsp;team / EGI Operations team
|
Member of the staff validates the request. <br>
 
You will be informed about the confirmation / rejection of the request by email .
 
|-
| 3
| ARGO&nbsp;team / EGI Operations team
|
If the request is accepted - the recomputation will be triggered as soon as possible .
 
The status of the recomputation will be visible trough a web page (link given in the email of the previous step )
 
|}
|}
<br>
= Tips  =
*Date formats
You can use the Unix <tt>date</tt> command to convert the start and end time from your time zone to <tt>UTC</tt> using the [http://en.wikipedia.org/wiki/ISO_8601 ISO 8601] format.
''the start time must be rounded to the lower hour and the end time rounded to the higher hour''
Example:
# date --date="12 Feb 2012 17:00 CET" --utc --iso-8601=hours
will give:
2012-02-12T16:00+0000
= Revision history  =
{| class="wikitable"
|-
! Version
! Authors
! Date
! Comments
|-
| <br>
| George Fergadis/AUTH
| 03/05/2012
| updated policy and procedure to reflect the OMB decision of the March 2012 meeting
|-
| <br>
| George Fergadis/AUTH
| 17/01/2012
| the text of the procedure is fixed to clarify that both RC administrators and regional operations staff can request a re-computation.
|-
| <br>
| George Fergadis/AUTH
| 16/01/2012
| the text of the procedure is fixed to clarify that the recomputation of test results can be requested before the end of the affected month, in which case if sufficient time is allowed for fixing of the test results, no re-computation of availability/reliability statistics will be needed.
|-
| <br>
| M. Krakowian
| 19 August 2014
| Change contact group -&gt; Operations support
|-
|
| M. Krakowian
| 3.11.2014
|
Add step2: Note:''&nbsp;It is recommended to open a [https://wiki.egi.eu/wiki/PROC10 ticket for recomputation as] soon as the problems has been detected.''
|-
|
| C. L'Orphelin
| 28.09.2015
| ARGO&nbsp;procedure&nbsp;: no need of a ticket , replacement by a recomputation form
|-
|
| Alessandro Paolini
| 2016-06-08
| Changed contact group -&gt; Operations
|}
[[Category:Operations_Procedures]]

Latest revision as of 09:43, 15 April 2022