Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "Operations Procedures"

From EGIWiki
Jump to navigation Jump to search
(Undo revision 15490 by Iivanoc (Talk))
Line 4: Line 4:
{{TOC_right}}
{{TOC_right}}


= Operations =
= Operations =
 
EGI Operational Procedures are prescriptive documents that describe step-by-step processes involving several partners. The purpose of a procedure is define the related workflow. Procedures are approved by the OMB and are periodically reviewed.  
EGI Operational Procedures are prescriptive documents that describe step-by-step processes involving several partners. The purpose of a procedure is define the related workflow. Procedures are approved by the OMB and are periodically reviewed.  


{|border="1" class="wikitable sortable" border="1
{| border="1" class="wikitable sortable"
|- style="background-color:lightgray;"
|- style="background-color: lightgray;"
| '''Number'''
| '''Number'''  
| '''Status'''
| '''Status'''  
| '''Area'''
| '''Area'''  
| '''Relevant to'''
| '''Relevant to'''  
| '''Title'''
| '''Title'''  
| '''Comment'''
| '''Comment'''
|-
|-
| [[PROC01|PROC 01]]
| [[PROC01|PROC 01]]  
| ''approved'', October 26 2010
| ''approved'', October 26 2010  
| Ticket Management
| Ticket Management  
| Resource Centre Administrators, Operations Centres, COD
| Resource Centre Administrators, Operations Centres, COD  
|[[PROC01|COD Escalation Procedure]]
| [[PROC01|COD Escalation Procedure]]  
| Operations ticket escation
| Operations ticket escation
|-
|-
| [[PROC02|PROC 02]]
| [[PROC02|PROC 02]]  
| ''approved'', August 17 2010
| ''approved'', August 17 2010  
| Operations Centre Management
| Operations Centre Management  
| Operations Centres, COD
| Operations Centres, COD  
|[[PROC02|Operations Centre Creation]]
| [[PROC02|Operations Centre Creation]]  
| Step-by-step instructions on how to create a new Operations Centre
| Step-by-step instructions on how to create a new Operations Centre
|-
|-
| [[PROC03|PROC 03]]
| [[PROC03|PROC 03]]  
| ''approved'', October 26 2010
| ''approved'', October 26 2010  
| Operations Centre Management
| Operations Centre Management  
| Operations Centres, COD
| Operations Centres, COD  
|[[Operations:Operations_Centre_decommission|Operations Centre decommissioning]]
| [[Operations:Operations Centre decommission|Operations Centre decommissioning]]  
| Step-by-step instructions on how to decommission an Operations Centre
| Step-by-step instructions on how to decommission an Operations Centre
|-
|-
| [[Availability_and_reliability_monthly_statistics#Process_for_quality_verification|PROC 04]]
| [[Availability and reliability monthly statistics#Process_for_quality_verification|PROC 04]]  
| ''approved'', August 17 2010
| ''approved'', August 17 2010  
| Availability and Monitoring
| Availability and Monitoring  
| Resource Centre Administrators, Operations Centres, COD
| Resource Centre Administrators, Operations Centres, COD  
|[[Availability_and_reliability_monthly_statistics#Process_for_quality_verification|Quality verification of monthly availability and reliability statistcs]]
| [[Availability and reliability monthly statistics#Process_for_quality_verification|Quality verification of monthly availability and reliability statistcs]]  
| Instructions RODs and Operations Centres on how to handle justification for poor monthly performance through GGUS
| Instructions RODs and Operations Centres on how to handle justification for poor monthly performance through GGUS
|-
|-
| [[PROC05|PROC 05]]
| [[PROC05|PROC 05]]  
| ''approved'', August 17 2010
| ''approved'', August 17 2010  
| Availability and Monitoring
| Availability and Monitoring  
| Operations Centres, COD
| Operations Centres, COD  
|[https://twiki.cern.ch/twiki/bin/view/EGEE/ValidateROCNagios Validation of a Operations Centre Nagios]
| [https://twiki.cern.ch/twiki/bin/view/EGEE/ValidateROCNagios Validation of a Operations Centre Nagios]  
| This procedure is part of the [[Operations_Centre_creation_process_coordination|Operations Centre creation]] procedure.
| This procedure is part of the [[Operations Centre creation process coordination|Operations Centre creation]] procedure.
|-  
|-
| [[PROC06|PROC 06]]
| [[PROC06|PROC 06]]  
| ''approved'', Nov 23 2010
| ''approved'', Nov 23 2010  
| Availability and Monitoring
| Availability and Monitoring  
| Operations Centres, COD
| Operations Centres, COD  
|[[PROC06 |Setting a Nagios test status to OPERATIONS]]
| [[PROC06|Setting a Nagios test status to OPERATIONS]]  
| A Nagios probe is set to OPERATIONS when its results are used to generate notifications for the Operations Dashboard. This procedure details the steps to turn a Nagios test to OPERATIONs.
| A Nagios probe is set to OPERATIONS when its results are used to generate notifications for the Operations Dashboard. This procedure details the steps to turn a Nagios test to OPERATIONs.
|-
|-
| [[PROC07|PROC 07]] <!-- Procedure number -->
| [[PROC07|PROC 07]] <!-- Procedure number -->  
| ''approved'', Mar 28 2011 <!-- Status -->
| ''approved'', Mar 28 2011 <!-- Status -->  
| Availability and Monitoring <!-- Area -->
| Availability and Monitoring <!-- Area -->  
| Resource Centre Administrators, Operations Centres, COD <!-- Relevant to -->
| Resource Centre Administrators, Operations Centres, COD <!-- Relevant to -->  
|[[PROC07 |Adding new probes to SAM]] <!-- Title -->
| [[PROC07|Adding new probes to SAM]] <!-- Title -->  
| Procedure for adding new OPS Nagios probes to the SAM release. <!-- Comment -->
| Procedure for adding new OPS Nagios probes to the SAM release. <!-- Comment -->
|-
|-
| [[PROC08|PROC 08]] <!-- Procedure number -->
| ''approved'', Mar 28 2011 <!-- Status -->
| Availability and Monitoring <!-- Area -->
| Resource Centre Administrators, Operations Centres, COD <!-- Relevant to -->
| [[PROC08|Management of the EGI OPS Availability and Reliability Profile]] <!-- Title -->
| Procedure for requesting a OPS EGI Availability and Reliability profile. A change in the profile is needed every time a new Nagios test needs to be added/removed to/from the profile, in order to have its results included/removed in/from Availability and Reliability monthly statistics. <!-- Comment -->
|-
|-
| [[PROC08|PROC 08]] <!-- Procedure number -->
|
| ''approved'', Mar 28 2011 <!-- Status -->
|
| Availability and Monitoring <!-- Area -->
|
| Resource Centre Administrators, Operations Centres, COD <!-- Relevant to -->
|
|[[PROC08 |Management of the EGI OPS Availability and Reliability Profile]] <!-- Title -->
|
| Procedure for requesting a OPS EGI Availability and Reliability profile. A change in the profile is needed every time a new Nagios test needs to be added/removed to/from the profile, in order to have its results included/removed in/from Availability and Reliability monthly statistics.  <!-- Comment -->
|
|}
|}



Revision as of 14:41, 17 May 2011

Main EGI.eu operations services Support Documentation Tools Activities Performance Technology Catch-all Services Resource Allocation Security


Documentation menu: Home Manuals Procedures Training Other Contact For: VO managers Administrators


Operations

EGI Operational Procedures are prescriptive documents that describe step-by-step processes involving several partners. The purpose of a procedure is define the related workflow. Procedures are approved by the OMB and are periodically reviewed.

Number Status Area Relevant to Title Comment
PROC 01 approved, October 26 2010 Ticket Management Resource Centre Administrators, Operations Centres, COD COD Escalation Procedure Operations ticket escation
PROC 02 approved, August 17 2010 Operations Centre Management Operations Centres, COD Operations Centre Creation Step-by-step instructions on how to create a new Operations Centre
PROC 03 approved, October 26 2010 Operations Centre Management Operations Centres, COD Operations Centre decommissioning Step-by-step instructions on how to decommission an Operations Centre
PROC 04 approved, August 17 2010 Availability and Monitoring Resource Centre Administrators, Operations Centres, COD Quality verification of monthly availability and reliability statistcs Instructions RODs and Operations Centres on how to handle justification for poor monthly performance through GGUS
PROC 05 approved, August 17 2010 Availability and Monitoring Operations Centres, COD Validation of a Operations Centre Nagios This procedure is part of the Operations Centre creation procedure.
PROC 06 approved, Nov 23 2010 Availability and Monitoring Operations Centres, COD Setting a Nagios test status to OPERATIONS A Nagios probe is set to OPERATIONS when its results are used to generate notifications for the Operations Dashboard. This procedure details the steps to turn a Nagios test to OPERATIONs.
PROC 07 approved, Mar 28 2011 Availability and Monitoring Resource Centre Administrators, Operations Centres, COD Adding new probes to SAM Procedure for adding new OPS Nagios probes to the SAM release.
PROC 08 approved, Mar 28 2011 Availability and Monitoring Resource Centre Administrators, Operations Centres, COD Management of the EGI OPS Availability and Reliability Profile Procedure for requesting a OPS EGI Availability and Reliability profile. A change in the profile is needed every time a new Nagios test needs to be added/removed to/from the profile, in order to have its results included/removed in/from Availability and Reliability monthly statistics.

Note: A Nagios probe is set to AVAILABILITY when its results are used for availability and reliability computation. The procedure to set a Nagios probe to AVAILABILITY will be finalized by TSA1.4 in QR4 2011.

Security

Number Status Area Relevant to Title Comment
SEC 01 approved, July 2010 (MS405) Security Resource Centres, EGI CSIRT Security Incident Handling The "Security Incident Handling Procedure" define site and incident coordinator responsibilities when handling Grid-related security incident. ALL EGI sites are required to follow this procedure to report and handle Grid-related security incident.
SEC 02 approved, July 2010 (MS405) Security Resource Centres, Risk Assessment Team, Technology Providers, SVG Vulnerability issue handling process The process used to report and resolve Grid Software vulnerabilities in the EGI Inspire project.
SEC 03 approved, March 15 2011 Security Resource Centres, Operations Centres, EGI-CSIRT, SVG EGI-CSIRT Critical Vulnerability Handling After a problem has been assessed as critical, and a solution is available, then sites are required to take action. This document primarily defines the procedure from this time, where sites are asked to take action, and what steps are taken if they do not respond or do not take action. If a site fails to take action, this may lead to site suspension.

More information

EGI Policies and Procedures

See all EGI policies and procedures