Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "Operations Procedures"

From EGIWiki
Jump to navigation Jump to search
Line 9: Line 9:
{| border="1" class="wikitable sortable"
{| border="1" class="wikitable sortable"
|- style="background-color: lightgray;"
|- style="background-color: lightgray;"
| '''Number''' || '''Title''' || '''Comment''' || '''Area''' || '''Relevant to''' || '''Last update'''  
| '''Number'''  
| '''Title'''  
| '''Comment'''  
| '''Area'''  
| '''Relevant to'''  
| '''Last update'''
|-
|-
| [[PROC01|PROC 01]]  
| [[PROC01|PROC 01]]  
| [[PROC01|Grid Oversight Escalation]]  
| [[PROC01|Grid Oversight Escalation]]  
| Operations ticket escation
| Operations ticket escation  
| Ticket Management  
| Ticket Management  
| Resource Centre Administrators, Operations Centres, COD  
| Resource Centre Administrators, Operations Centres, COD  
Line 20: Line 25:
| [[PROC02|PROC 02]]  
| [[PROC02|PROC 02]]  
| [[PROC02|Operations Centre Creation]]  
| [[PROC02|Operations Centre Creation]]  
| Step-by-step instructions on how to create a new Operations Centre
| Step-by-step instructions on how to create a new Operations Centre  
| Operations Centre Management  
| Operations Centre Management  
| Operations Centres, COD  
| Operations Centres, COD  
| 15.11.2011
| 15.11.2011
|-
|-
| [[PROC03|PROC 03]]  
| [[PROC03|PROC 03]]  
| [[PROC03|Operations Centre decommissioning]]  
| [[PROC03|Operations Centre decommissioning]]  
| Step-by-step instructions on how to decommission an Operations Centre
| Step-by-step instructions on how to decommission an Operations Centre  
| Operations Centre Management  
| Operations Centre Management  
| Operations Centres, COD  
| Operations Centres, COD  
| 22.10.2010  
| 22.10.2010
|-
|-
| [[PROC04|PROC 04]]  
| [[PROC04|PROC 04]]  
| [[PROC04|Quality verification of monthly availability and reliability statistics]]  
| [[PROC04|Quality verification of monthly availability and reliability statistics]]  
| Instructions RODs and Operations Centres on how to handle justification for poor monthly performance
| Instructions RODs and Operations Centres on how to handle justification for poor monthly performance  
| Availability and Monitoring  
| Availability and Monitoring  
| Resource Centre Administrators, Operations Centres, COD  
| Resource Centre Administrators, Operations Centres, COD  
Line 40: Line 45:
|-
|-
| [[PROC05|PROC 05]]  
| [[PROC05|PROC 05]]  
| [[PROC05| Validation of Operations Centre Nagios]]
| [[PROC05|Validation of Operations Centre Nagios]]  
| This procedure is part of the [[PROC02|Operations Centre creation]] procedure.
| This procedure is part of the [[PROC02|Operations Centre creation]] procedure.  
| Availability and Monitoring  
| Availability and Monitoring  
| Operations Centres, COD  
| Operations Centres, COD  
| 19.01.2012
| 19.01.2012
|-
|-
| [[PROC06|PROC 06]]  
| [[PROC06|PROC 06]]  
| [[PROC06|Setting a Nagios test status to OPERATIONS]]  
| [[PROC06|Setting a Nagios test status to OPERATIONS]]  
| A Nagios probe is set to OPERATIONS when its results are used to generate notifications for the Operations Dashboard. This procedure details the steps to turn a Nagios test to OPERATIONs.
| A Nagios probe is set to OPERATIONS when its results are used to generate notifications for the Operations Dashboard. This procedure details the steps to turn a Nagios test to OPERATIONs.  
| Availability and Monitoring  
| Availability and Monitoring  
| Operations Centres, COD  
| Operations Centres, COD  
| 23.11.2010  
| 23.11.2010
|-
|-
| [[PROC07|PROC 07]] <!-- Procedure number -->  
| [[PROC07|PROC 07]] <!-- Procedure number -->  
| [[PROC07|Adding new probes to SAM]] <!-- Title -->  
| [[PROC07|Adding new probes to SAM]] <!-- Title -->  
| Addition of new OPS Nagios probes to the SAM release. <!-- Comment -->
| Addition of new OPS Nagios probes to the SAM release. <!-- Comment -->  
| Availability and Monitoring <!-- Area -->  
| Availability and Monitoring <!-- Area -->  
| Resource Centre Administrators, Operations Centres, COD <!-- Relevant to -->  
| Resource Centre Administrators, Operations Centres, COD <!-- Relevant to -->  
| 16.03.2011  
| 16.03.2011
|-
|-
| [[PROC08|PROC 08]] <!-- Procedure number -->  
| [[PROC08|PROC 08]] <!-- Procedure number -->  
| [[PROC08|Management of the EGI OPS Availability and Reliability Profile]] <!-- Title -->  
| [[PROC08|Management of the EGI OPS Availability and Reliability Profile]] <!-- Title -->  
| Request of a OPS EGI Availability and Reliability profile. A change in the profile is needed every time a new Nagios test needs to be added/removed to/from the profile, in order to have its results included/removed in/from Availability and Reliability monthly statistics. <!-- Comment -->
| Request of a OPS EGI Availability and Reliability profile. A change in the profile is needed every time a new Nagios test needs to be added/removed to/from the profile, in order to have its results included/removed in/from Availability and Reliability monthly statistics. <!-- Comment -->  
| Availability and Monitoring <!-- Area -->  
| Availability and Monitoring <!-- Area -->  
| Resource Centre Administrators, Operations Centres, COD <!-- Relevant to -->  
| Resource Centre Administrators, Operations Centres, COD <!-- Relevant to -->  
| 16.03.2011  
| 16.03.2011
|-
|-
|[[PROC09|PROC 09]] <!-- Procedure number -->  
| [[PROC09|PROC 09]] <!-- Procedure number -->  
| [[PROC09|Resource Centre Registration and Certification]] <!-- Title -->
| [[PROC09|Resource Centre Registration and Certification]] <!-- Title -->  
| Registration of a new Resource Centre in the GOCDB
| Registration of a new Resource Centre in the GOCDB  
| Resource Centre Management
| Resource Centre Management  
| Resource Centre Administrator, Operations Centres
| Resource Centre Administrator, Operations Centres  
| 15.10.2012  
| 15.10.2012
|-
|-
|[[PROC10|PROC 10]] <!-- Procedure number -->  
| [[PROC10|PROC 10]] <!-- Procedure number -->  
| [[PROC10|Recomputation of monitoring results and availability statistics]] <!-- Title -->  
| [[PROC10|Recomputation of monitoring results and availability statistics]] <!-- Title -->  
| Notification of problems with the monitoring results gathered by SAM and to request a recomputation of results and the related availability and reliability statistics
| Notification of problems with the monitoring results gathered by SAM and to request a recomputation of results and the related availability and reliability statistics  
| Availability and Monitoring <!-- Area -->  
| Availability and Monitoring <!-- Area -->  
| Resource Centre Administrators, Operations Centres<!-- Relevant to -->  
| Resource Centre Administrators, Operations Centres<!-- Relevant to -->  
| 27.09.2012  
| 27.09.2012
|-
|-
| [[PROC11|PROC 11]]
| [[PROC11|PROC 11]]  
| [[PROC11|Resource Centre Decommissioning]]
| [[PROC11|Resource Centre Decommissioning]]  
| Decommissioning of a Resource Centre before it is turned into CLOSED in GOCDB
| Decommissioning of a Resource Centre before it is turned into CLOSED in GOCDB  
| Resource Centre Management
| Resource Centre Management  
| Resource Centre Administrator, Operations Centres
| Resource Centre Administrator, Operations Centres  
| 15.03.2012  
| 15.03.2012
|-
|-
| [[PROC12|PROC 12]]
| [[PROC12|PROC 12]]  
| [[PROC12|Production Service Decommissioning]]
| [[PROC12|Production Service Decommissioning]]  
| Decommissioning of a EGI production service  
| Decommissioning of a EGI production service  
| Resource Centre Management
| Resource Centre Management  
| Resource Centre Administrator, Operations Centres
| Resource Centre Administrator, Operations Centres  
| 24.09.2012
| 24.09.2012
|-
|-
| [[PROC13|PROC 13]]
| [[PROC13|PROC 13]]  
| [[PROC13|VO Deregistration]]
| [[PROC13|VO Deregistration]]  
| Decommissioning of a Virtual Organization supported by the European Grid Infrastructure  
| Decommissioning of a Virtual Organization supported by the European Grid Infrastructure  
| VO Management
| VO Management  
| VO Managers, Operations Manager
| VO Managers, Operations Manager  
| 16.07.2012  
| 16.07.2012
|-
|-
|[[PROC14| PROC 14]]
| [[PROC14|PROC 14]]  
| [[PROC14| VO Registration]]
| [[PROC14|VO Registration]]  
| Registration of a Virtual Organization to the European Grid Infrastructure  
| Registration of a Virtual Organization to the European Grid Infrastructure  
| VO Management
| VO Management  
| VO Managers, Operations Manager
| VO Managers, Operations Manager  
| 30.10.2012
| 30.10.2012
|-
|-
| [[PROC15| PROC 15]]
| [[PROC15|PROC 15]]  
| [[PROC15| Resource Center renaming]]
| [[PROC15|Resource Center renaming]]  
| A procedure for changing name of a Resource Center.  
| A procedure for changing name of a Resource Center.  
| Resource Centre Management
| Resource Centre Management  
| Resource Centre Administrator, Operations Centres
| Resource Centre Administrator, Operations Centres  
| 30.10.2012
| 30.10.2012
|-
|-
| [[PROC16| PROC 16]]
| [[PROC16|PROC 16]]  
| [[PROC16| Decommissioning of unsupported software]]
| [[PROC16|Decommissioning of unsupported software]]  
| A procedure for removal of unsupported software from production infrastructure.
| A procedure for removal of unsupported software from production infrastructure.  
| Resource Centre Management
| Resource Centre Management  
| Resource Centre Administrator, Operations Centres
| Resource Centre Administrator, Operations Centres  
| 20.11.2012
| 20.11.2012
|-
|-
| [[PROC17| PROC 17]]
| [[PROC17|PROC 17]]  
| [[PROC17| Decommissioning of service type]]
| [[PROC17|Decommissioning of service type]]  
| A procedure for removal of service type from production infrastructure.
| A procedure for removal of service type from production infrastructure.  
| Resource Centre Management
| Resource Centre Management  
| Resource Centre Administrator, Operations Centres
| Resource Centre Administrator, Operations Centres  
|  
| 18.06.2013
|}
|}


[[Procedure_template |Structure template for new procedures]]
[[Procedure template|Structure template for new procedures]]


= Security =
= Security =

Revision as of 14:10, 18 June 2013

Main EGI.eu operations services Support Documentation Tools Activities Performance Technology Catch-all Services Resource Allocation Security


Documentation menu: Home Manuals Procedures Training Other Contact For: VO managers Administrators


Operations

EGI Operational Procedures are prescriptive documents that describe step-by-step processes involving several partners. The purpose of a procedure is define the related workflow. Procedures are approved by the OMB and are periodically reviewed.

Number Title Comment Area Relevant to Last update
PROC 01 Grid Oversight Escalation Operations ticket escation Ticket Management Resource Centre Administrators, Operations Centres, COD 20.11.2012
PROC 02 Operations Centre Creation Step-by-step instructions on how to create a new Operations Centre Operations Centre Management Operations Centres, COD 15.11.2011
PROC 03 Operations Centre decommissioning Step-by-step instructions on how to decommission an Operations Centre Operations Centre Management Operations Centres, COD 22.10.2010
PROC 04 Quality verification of monthly availability and reliability statistics Instructions RODs and Operations Centres on how to handle justification for poor monthly performance Availability and Monitoring Resource Centre Administrators, Operations Centres, COD 30.10.2012
PROC 05 Validation of Operations Centre Nagios This procedure is part of the Operations Centre creation procedure. Availability and Monitoring Operations Centres, COD 19.01.2012
PROC 06 Setting a Nagios test status to OPERATIONS A Nagios probe is set to OPERATIONS when its results are used to generate notifications for the Operations Dashboard. This procedure details the steps to turn a Nagios test to OPERATIONs. Availability and Monitoring Operations Centres, COD 23.11.2010
PROC 07 Adding new probes to SAM Addition of new OPS Nagios probes to the SAM release. Availability and Monitoring Resource Centre Administrators, Operations Centres, COD 16.03.2011
PROC 08 Management of the EGI OPS Availability and Reliability Profile Request of a OPS EGI Availability and Reliability profile. A change in the profile is needed every time a new Nagios test needs to be added/removed to/from the profile, in order to have its results included/removed in/from Availability and Reliability monthly statistics. Availability and Monitoring Resource Centre Administrators, Operations Centres, COD 16.03.2011
PROC 09 Resource Centre Registration and Certification Registration of a new Resource Centre in the GOCDB Resource Centre Management Resource Centre Administrator, Operations Centres 15.10.2012
PROC 10 Recomputation of monitoring results and availability statistics Notification of problems with the monitoring results gathered by SAM and to request a recomputation of results and the related availability and reliability statistics Availability and Monitoring Resource Centre Administrators, Operations Centres 27.09.2012
PROC 11 Resource Centre Decommissioning Decommissioning of a Resource Centre before it is turned into CLOSED in GOCDB Resource Centre Management Resource Centre Administrator, Operations Centres 15.03.2012
PROC 12 Production Service Decommissioning Decommissioning of a EGI production service Resource Centre Management Resource Centre Administrator, Operations Centres 24.09.2012
PROC 13 VO Deregistration Decommissioning of a Virtual Organization supported by the European Grid Infrastructure VO Management VO Managers, Operations Manager 16.07.2012
PROC 14 VO Registration Registration of a Virtual Organization to the European Grid Infrastructure VO Management VO Managers, Operations Manager 30.10.2012
PROC 15 Resource Center renaming A procedure for changing name of a Resource Center. Resource Centre Management Resource Centre Administrator, Operations Centres 30.10.2012
PROC 16 Decommissioning of unsupported software A procedure for removal of unsupported software from production infrastructure. Resource Centre Management Resource Centre Administrator, Operations Centres 20.11.2012
PROC 17 Decommissioning of service type A procedure for removal of service type from production infrastructure. Resource Centre Management Resource Centre Administrator, Operations Centres 18.06.2013

Structure template for new procedures

Security

Number Title Comment Status Area Relevant to
SEC 01 EGI Security Incident Handling The "Security Incident Handling Procedure" define site and incident coordinator responsibilities when handling Grid-related security incident. ALL EGI sites are required to follow this procedure to report and handle Grid-related security incident. approved, July 2010 (MS405) Security Resource Centres, EGI CSIRT
SEC 02 EGI Vulnerability issue handling process The process used to report and resolve Grid Software vulnerabilities in the EGI Inspire project. approved, July 2010 (MS405) Security Resource Centres, Risk Assessment Team, Technology Providers, SVG
SEC 03 Critical Vulnerability Operational Procedure After a problem has been assessed as critical, and a solution is available, then sites are required to take action. This document primarily defines the procedure from this time, where sites are asked to take action, and what steps are taken if they do not respond or do not take action. If a site fails to take action, this may lead to site suspension. approved, March 15 2011 Security Resource Centres, Operations Centres, EGI-CSIRT, SVG

More information

EGI Policies and Procedures

See all EGI policies and procedures