PROC02 Operations Centre creation

From EGIWiki
(Redirected from PROC02)
Jump to: navigation, search
Main EGI.eu operations services Support Documentation Tools Activities Performance Technology Catch-all Services Resource Allocation Security


Documentation menu: Home Manuals Procedures Training Other Contact For: VO managers Administrators


Contents


Title Operations Centre creation
Document link https://wiki.egi.eu/wiki/PROC02
Last modified 19 August 2014
Policy Group Acronym OMB
Policy Group Name Operations Management Board
Contact Group operations@egi.eu
Document Status Approved
Approved Date 26.10.2010
Procedure Statement The purpose of this document is to clearly describe the actions and the relative steps to be undertaken for integrating a Operations Centre into the EGI operational structure.


Overview

The purpose of this document is to clearly describe actions and relative steps to be undertaken for integrating an Operations Centre into the EGI operational structure.

Definitions

The Integration Process Coordinator (IPC) is the entity responsible for integrating a new Operations Centre within EGI.

Please refer to the EGI Glossary for the definitions of the terms used in this procedure.

The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", “MAY", and "OPTIONAL" in this document are to be interpreted as described in RFC 2119.

Validation

First step for validating an Operations Centre is to be politically approved as formed from an official partner of the EGI infrastructure.

After this step, the Operations Centre needs to be technically validated. The Operations team is responsible for performing this validation.

Political Validation

CASE 1. If an Operations Centre is already represented within the EGI Council we recommend that the Operations Centre political representative within the EGI Council notifies the EGI Chief Operations Officer that the respective Operations Centre is entering its validation cycle. At this point, technical validation can start.

CASE 2. If an Operations Centre is not represented within the EGI Council, and it is willing to be represented there, the Operations Centre needs to submit a request for admission to the Council. After the Operations Centre has been accepted by the Council, CASE 1 applies.

CASE 3. If a new Operations Centre is not represented within the EGI Council and is not interested in being part of it, but would still like to be a consumer of the EGI Global Services, then an MoU must be established with EGI. Once an MoU is in place technical validation can start.

Technical Validation

  1. The Operations Centre Operations manager(s) sends an email to the Chief Operations Officer (COO) that the Council was informed about the creation of a new Operations Centre, and (if applicable) also that the Council has approved it. The Operations Centre Operations manager(s) should also indicate the IPC responsible for the validation in the email.
  1. The Chief Operations Officer opens a GGUS ticket to the IPC to start the validation process.

Start of the integration


An example/template for the Operations Centre creation ticket is provided here:

Subject: Creation of <Operations Centre name>


Required information for the creation of the Operations Centre:
Management mailing list : management@xxx.org
Operations Centre Operations manager contact data : Person Surname (email) +phone contact,
Deputy: Person Surname (email) +phone contact,

Operations Centre security officer contact data : Person Surname (email) +phone contact,
Operations Centre security mailing list : abuse@xxx.org

ROD team mailing list : ngi-support@xxx.org
Operations Centre nagios monitoring system details : https://mon-ngi.xxx.org/nagios
Mailing list for GGUS tickets if using GGUS directly : xxxticket@xxx.org

The FAQ document for the Operations Centre provided by the GGUS team as described in [[GGUS:FAQ_Responsible_Units]] 

Pre-requisites

Before opening an Operations Centre creation GGUS ticket, the Operations Centre should:

  1. Make sure your operational organization is able to fulfill RP OLA https://documents.egi.eu/secure/ShowDocument?docid=463
  2. Decide about the Operations Centre name. In case of national European Operations Centres, the name should start with "NGI_"
  3. Decide whether to use the Operations Centre's own help desk system or use GGUS directly. If the Operations Centre wants to set up their own system they need to provide an interface for interaction to GGUS with the local ticketing system and follow the recommendations available at https://ggus.eu/pages/ggus-docs/interfaces/docu_ggus_interfaces.php.
  4. Set the following contact points:
    1. Management mailing list
    2. Operations Centre operations manager contact data
    3. Operations Centre security officer contact data
    4. Operations Centre security mailing list
    5. ROD team mailing list (including people responsible for monitoring and supporting the Operations Centre infrastructure)
    6. Mailing list for GGUS tickets IF GGUS is used directly for the helpdesk system
  5. All certified Operations Centre sites need to be monitored by ARGO monitoring infrastructure operated centrally by EGI.
  6. Fill the FAQ document for the Operations Centre. The template is provided by the GGUS team: GGUS:FAQ_Responsible_Units
  7. Staff in the Operations Centre that should be granted a management role (manager, deputy manager, security officer) should first register a user account in the GOCDB. The user registration procedure is described in the GOCDB user documentation at https://twiki.cern.ch/twiki/bin/view/EMI/Glite-APELInstallation, section 4.1.1
  8. Staff in Operations Centre is familiar with Operational Procedures
  9. People who are responsible for operations should be subscribed to following mailing lists (unless differently specified):
    1. Operations Centre manager:
      noc-managers [at] mailman.egi.eu - registration through https://mailman.egi.eu/mailman/listinfo
    2. ROD team:
      All-operator-on-duty [at] mailman.egi.eu - list which integrate all Operations Centres ROD mailing list.
      Mailing list is populated automatically from GOCDB. New Operations Centres should make sure to record accurate information in GOCDB.
    3. Regional Helpdesk administrator:
      ggus-if-devs [at] cern.ch - mailing list designed for coordination of changes in the interface
      To register please send a request (through for example GGUS system) to GGUS support staff.
    4. Security staff:
      ngi-security-contacts [at] mailman.egi.eu, NGI or OC security officers subscribe to this mailing list
      site-security-contacts [at] mailam.egi.eu, Site (only certified site) security officers subscribe to this mailing list

      Both mailing lists are populated automatically from GOCDB. New Operations Centres should make sure to record accurate information in GOCDB.
    5. (Recommended) Site administrators:
      LCG-ROLLOUT [at] jiscmail.ac.uk - list gathers all site admins and is designed for technical discussions - membership not mandatory but useful
      Subscription is possible through https://www.jiscmail.ac.uk/cgi-bin/webadmin?SUBED1=LCG-ROLLOUT&A=1

Steps

Some steps of the process can be done in parallel as they are independent, so all steps grouped within the same task number can be performed concurrently (several different child tickets will be created in order to speed up the process). The general idea is that these tickets must be closed before being able to move on to the next step.

Operations Centre creation.jpg
Creation procedure for new Operations Centre if the Operations Centre wasn't part of a ROC


Validation steps:

Step Substep Action on Action Required data Notes
1
IPC Verification of the validity of the request (were all needed data provided?)
To check is all needed information are provided to make the process quick
2
IPC Create child tickets in the order given as follows:


1 GOCDB Creation of a new Operations Centre entry in the (with no site attached).
  • Operations Centre name: <Operations Centre name>
  • Operations Centre management mailing list: foo@bar.org
  • Operations Centre security mailing list:
  • Operations Centre Operations manager:
  • Operations Centre security officer:
To register new  OC in official EGI database

2 Operations Portal Enter the new Operations Centre in the Operations dashboard (also add ROD mailing list) ROD email list To allow new OC to use operations dashboard which is required to perform grid oversight activity/span>

3 GGUS Create a new support unit in GGUS


OC should fill in https://wiki.egi.eu/wiki/GGUS:NGI_XXX_FAQ or https://wiki.egi.eu/wiki/GGUS:XXX_FAQ

Instruction how to create GGUS wiki page GGUS:FAQ Responsible Units

To make possible to ceate tickets in official EGI ticketing system to new OC

4 Operations Certification of new ROD team

Procedure_to_handle_new_ROD_certification_GGUS_tickets

Include in the GGUS ticket for :

  • Country
  • Operations Centre acronym
  • ROD email list
To verify if the future ROD team members are properly trained to perform their duties
3
The newly created Operations Centre

Confirmation that Operations Centre read, accept and agree to fulfill the RP OLA 

https://documents.egi.eu/secure/ShowDocument?docid=463



4
Dteam VO manager Create a branch/group in Dteam VO for Operations Centre and assign DN of people who will be dteam VO representative for the Operations Centre, Operations Centre Group owner and Operations Centre Group manager.

Responsibilities and terminology

VO representative: A person that can approve or deny dteam VO membership requests. This person is selected by the applicant during the registration phase.

Group owner: A person that can approve or deny Group membership requests. In addition he can create subgroups within his Group.

Group manager: A person that can approve or deny Group membership requests.

These responsibilities can be assigned to the same person(s).

How to assign the child ticket: assign the ticket to "VO support" after the selection of "dteam" in the concerned VO field

This step is not blocking the process and can be done in parallel. It is required to finish this step before closing parent ticket.

Following data about each responsible person:
  • Name and Surname
  • DN
  • email

To make possible to register operations staff from new OC to register in Dteam VO.

This VO gather all operations staff and allow them to access the grid.

5
The newly created Operations Centre Final confirmation that the new Operations Centre can start the operations
The confirmation means that OC will provide all services required in OLA
6
GOCDB
  • [If the Operations Centre was part of a OC]

GOCDB transfers related sites from the source ROC to the new Operations Centre structure.


The sites moving across to the new OC in the ticket indicated in the ticket.
The newly created Operations Centre
  • [If the Operations Centre wasn't part of a OC]

Newly created Operations Centre should insert at least one sites


Nagios system need at least one site to be validated.
7
The newly created Operations Centre

[If the Operations Centre was part of a OC] Transfer all open operational tickets to the new Operations Centre in GGUS.


To ensure that non of the GGUS tickets were forgotten durign the process.
8
The newly created Operations Centre Check that all the sites are visible in ARGO and that alarms show up in Operations portal

9
OPTIONAL The newly created Operations Centre All sites should configured GIIS according to the instructions at:

MAN1_How_to_publish_Site_Information

 GlueSiteOtherInfo: EGI_NGI=<Operations Centre name>
GlueSiteOtherInfo: GRID=EGI

This step can be performed at any time from this point.


To confirm that all sites publish proper data in information system about new NGI.
10

The newly created Operations Centre

[If the Operations Centre was part of a OC]

Inform managers of regional VOs to change the VO scope of their VOs to the relevant Operations Centre (national). This action require only confirmation from Operations Centre manager that information was passed.

Information which should be pass to VO managers: "The Vo scope can be changes by creating a ticket to Operations portal SU in GGUS."


To spread among Vo information
11
The newly created Operations Centre NGI_XX_SERVICES service group / XX_SERVICES service group

The newly created Operations Centre should create in GOC DB NGI_XX_SERVICES (if national entity) or XX_SERVICES (if international entity) service group and attached services listed on page: NGI services in GOCDB



12
The newly created Operations Centre Create a GGUS ticket to "Operations" SU with information about newly created service group and official OC Top-BDII, SAM, Argus instance. Ask to add OC manager to OC managers mailing list. 

 

13
IPC Final checks by the IPC.

Were all steps taken and finished properly? 

14
Operations

Final checks should be verified.

The information that the Operations Centre is ready should be sent in Monthly broadcast and announced during OMB by EGI Operations team.

(This broadcast should be sent to VO managers (except Ops and Dteam VO) and NOC/ROC managers)

See the template below for an indication of the message content.

Subject: <Operations Centre name> is operational

Dear All,

We would like to announce that <Operations Centre name> is now fully operational 
and that we have finished its integration procedure. All necessary operational 
teams and tools are established in our Operations Centre and we are ready for production. 
This Operations Centre is visible in all operational tools as <Operations Centre name> 
and is responsible for all <COUNTRY> sites.

Best regards,


Revision history

Version Authors Date Comments
2.11 Alessandro Paolini 2016-12-16 The monitoring is operated centrally, no more need of regional nagios servers. Procedure modified accordingly.
2.10 Alessandro Paolini 2016-06-08 made distinction between NGI (national entity) and Operations Centre (international entity, it could include also more NGIs). To check how to modify the step 14. To check how to modify the related wiki in step 16 (https://wiki.egi.eu/wiki/NGI_services_in_GOCDB).
2.09 Alessandro Paolini 2016-06-07 "EGI Operations Support" was decommissioned, changed all the references to "Operations"
2.08 Malgorzata Krakowian 2014-10-06 step 17 - providing information to Operations about NGI core services
2.07
Malgorzata Krakowian
2011-11-15
Reordered to have all Nagios related steps as close as possible. Added new column with explanation why the step should be taken.
2.06 Malgorzata Krakowian 2011-11-02 Step for RP OLA acceptance added; New point to prerequisites about RP OLA added
2.05 Malgorzata Krakowian 2011-09-28 Cleaning; GGUS require wiki page not a faq document etc.
2.04 Malgorzata Krakowian 2011-04-1 Step concerning SAMAP tool was removed due to tool decommission
2.03 Gonçalo Borges 2011-03-31 Operations Centre creation process visualization
2.02 T. Ferrari 2011-03-17 Assignment of procedure number, update of title "Operations Centre creation coordination procedure" to "Operations Centre Creation", small editorial improvements
2.02 Małgorzata Krakowian 2011-01-27 Name change from NGI to Operations Centre
2.01 Dimitris Zilaskos 2010-12-06 Clarification concerning dteam VO branch creation step
2.00 Małgorzata Krakowian, M. Radecki 2010-10-26 Approved by OMB
M. Krakowian 19 August 2014 Change contact group -> Operations support
Personal tools
Namespaces
Variants
Actions
Navigation
Toolbox
Print/export