Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "PROC19"

From EGIWiki
Jump to navigation Jump to search
(Deprecate page)
Tag: Replaced
 
(114 intermediate revisions by 6 users not shown)
Line 1: Line 1:
{{Template:Op menubar}} {{Template:Doc_menubar}}{{Template:Under_construction}} {{TOC_right}}
{{Template:Op menubar}} {{Template:Doc_menubar}}
 
[[Category:Deprecated]]
<br>
{| style="border:1px solid black; background-color:lightgrey; color: black; padding:5px; font-size:140%; width: 90%; margin: auto;"
 
| style="padding-right: 15px; padding-left: 15px;" |  
{{Ops_procedures
|[[File:Alert.png]] This page is '''Deprecated'''; the content has been moved to https://confluence.egi.eu/display/EGIPP/PROC19+Integration+of+new+cloud+management+framework+or+middleware+stack+in+the+EGI+Infrastructure
|Doc_title = Integration of new cloud stack and grid middleware in EGI Production Infrastructure
|Doc_link = [[PROC19|https://wiki.egi.eu/wiki/PROC19]]
|Version =  
|Policy_acronym = OMB
|Policy_name = Operations Management Board
|Contact_group = operations-support@mailman.egi.eu
|Doc_status = DRAFT
|Approval_date =
|Procedure_statement = A procedure for the steps to integrate new cloud stack (Cloud platform) or grid middleware (Grid Platform) in EGI Production Infrastructure.
}}
 
<br>
 
= Overview  =
 
To assure production quality of EGI Infrastructure every stack (Cloud platform) or middleware (Grid Platform) supported by Production Resource Centres needs to fulfil certain requirements. The goal of this procedure is to assure EGI Infrastructure compliance.
 
= Definitions  =
 
*'''cloud stack''': software for creating, managing, and deploying infrastructure cloud services.
*'''grid middleware''': software which allows the users to execute jobs in grid infrastructure.
 
<br> Please refer to the [[Glossary|EGI Glossary]] for the definitions of the terms used in this procedure.<br>
 
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", “MAY", and "OPTIONAL" in this document are to be interpreted as described in RFC 2119.
 
= Entities involved in the procedure  =
 
*'''Technology Product team leader (TPL)''':&nbsp;person representing and leading Technology Product team
*'''EGI Operations''' '''(EGIOps)'''
*'''Operations Centre (OC)'''
*'''Resource Centre (RC)'''
*'''[[Operations Management Board|Operations Management Board]]''': EGI operations policy board
 
= Steps  =
 
== Sending a request  ==
 
The requested can be send by:
 
#Operations Centre
#EGI Operations
#Technology Product team leader
 
Resource Centre can also request integration of new cloud stack or grid middleware. Such request should be first approved by Operations Centre, it belongs to. In such case OC&nbsp;is responsible to create a ticket on behalf of RC. <br>
 
<br>
 
{| class="wikitable"
|-
! Step
! Action on
! Action
|-
| 1
| Applicant<br>
| Opens a [https://ggus.eu/ GGUS] ticket to Operations to start the process. <pre>Subject: Request for integration of XXX to EGI Production Infrastructure
 
Dear Operations,
 
We would like to request for starting procedure of integrating XXX to EGI Production Infrastructure
 
Prerequisite data:
* name of Technology Product:
* customers of the Product (eg. user community, Operations Centre)
*&nbsp;motivation:
 
Best Regards
XXX
</pre>
|-
| 2
| EGIOps
| Operations contacts the OMB to request the approval of the request.
|}
|}
== Request Validation  ==
{| class="wikitable"
|-
! Step
! Substep
! Action on
! Action
|-
| 1
| 1
| RC
|
As soon as the problem is detected, notify your NGI operations centre by opening a [http://helpdesk.egi.eu/ GGUS ticket].
Please address the ticket to your Operations Centre support unit, who is responsible of validating the request. In the GGUS ticket you must mention:
#the starting and ending time of the problem (including day and hour in UTC)
#the Site, NGI/federation of NGIs affected by the problem
#the VO affected by the problem (must be the OPS VO)
#a description of the problem
|-
|
| 2
| TPL<br>
| The NGI operations centre validates the request.
|-
|
| 3
| OC
|
If the request is deemed valid, a GGUS ticket is sent to [[GGUS:SLM-FAQ|Service Level Management]](SLM) Support Unit.
The SLM support team will take care of discussing all requests received with the SAM team.
|-
| 4
|
| SLM SU
|
#validating the reported problems
#discuss the reported problems with the SAM Support Unit if needed
#notify the SAM SU about the requests received through a new parent ticket is submitted to SAM with the children tickets of the validated requests
|-
| 5<br>
|
| SLM SU
|
#The SAM Support Unit is responsible of checking the requests and of regenerating the results. For the accepted requests all Nagios metric results for any site and service are set to ''unknown'' status from the beginning of the hour reported in the starting time to one hour after the ending time. This is to cover late results that could have arrived later. the availability and reliability of other sites won't be affected, as unknown periods are not considered in the computation.
#New monthly availability statistics will be recomputed for that particular period, Site, NGI/federation of NGIs.
#A new report will be made available 10 days after the first publication of the report.
#After publication of the new report, all child GGUS tickets will be closed.
|-
| 6
|
| SLM SU
| Close the parent ticket.
|}
<br>
<br>
<br>
== Integration steps  ==
Integration covers following areas:
#'''Management '''- integration with Grid Configuration Database [GOCDB] ([http://goc.egi.eu/ http://goc.egi.eu/])<br>
#*e.g. creation of a service type<br>
#'''Monitoring '''- integration with Service Availability Monitoring [SAM] <br>
#*development of monitoring probes <br>
#*integration of probes into SAM release<br>
#*integration of probes with Ava/Rel profile<br>
#'''Accounting '''- integration with EGI Accounting Infrastructure<br>
#*enabling all the accounting data to be collected in one place for a unified view<br>
#'''Support '''- integration with EGI Helpdesk [GGUS] ([http://ggus.eu http://ggus.eu])<br>
#*creation of support unit and using it to provide support<br>
#'''Dashboard '''- integration with Operations Portal ([http://operations-portal.egi.eu/ http://operations-portal.egi.eu/])
#*integration of probes into Operations Tests pool
#'''Documentation '''- integration with existing operations procedures <br>
#*integration with Resource Center OLA
#*integration with Resource Center registration and certyfication procedure
#**creation of how to manually test the middleware
#'''Information System''' - integration with EGI Information system
{| class="wikitable"
|-
! #
! Responsible
! Action
|- valign="top"
| 0
| RC
|
'''Contact your Resource Infrastructure Operations Manager''' (contact information is available at [http://www.egi.eu/community/resource-providers/ EGI web site]).
*Provide the required information according to the template available in the [[HOWTO01|Required information]] page.
|- valign="top"
| 1
| RP
|
'''Accept or reject registration request''' and communicate this result back to applicant.
*If the Resource Centre is accepted, notify the relevant Operations Centre, handle the Resource Centre information received, and put the Operations Centre in contact with the Resource Centre Operations Manager.
|- valign="top"
| 2
| OC
|
#'''Forward all documentation''':
#*[[HOWTO02|necessary to be read and accept]]
#*documentation how to install and configure the Resource Centre services
#Clarify any doubts or questions.
Include the Operations Centre ROD, CSIRT,&nbsp; or help-desk teams in the step if necessary.
|- valign="top"
| 3
| OC
|
#'''Add the Resource Centre to the [https://goc.egi.eu/ GOCDB]'''and flag it as "Candidate".
#*Only Regional Management level users (D') can add a site to the NGI and can update the certification status of the site, see [[GOCDB/Input System User Documentation#Roles]]
#Notify the Resource Centre Operations Manager that he/she should [[EGI Operations Start Guide#Joining_operations|Join operations]]
#*In the [https://goc.egi.eu/ GOCDB] he/she should request the 'Resource Centre Operations Manager' role. Approve it when done.
#Notify the Resource Centre Operations Manager that person responsible for security should [[EGI Operations Start Guide#Joining_operations|Join operations]]
#*In the [https://goc.egi.eu/ GOCDB] he/she should request the 'Resource Centre Security Officer' role. Approve it when done.
|- valign="top"
| 4
| RC
|
#'''Complete any missing information for the Resource Centre's entry in the GOCDB'''
#*It includs services that are to be integrated into the infrastructure. See [[Fedcloud-tf:WorkGroups:Scenario5#GOCDB|instruction]]
#Notify the Operations Centre that the Resource Centre information update is concluded.
#Note: If the external RC is considering buying host certs, please make sure they source them from an EGI recognised authority. [http://www.eugridpma.org/members/worldmap/ The local national CA] (usually for free or at flat rate) is likely the best source of their SSL certificates.
|- valign="top"
| 5
| OC
|
'''Check [http://goc.egi.eu/ GOC DB]''' that the Resource Centre's information is correct.
*Resource Centre (site) roles and any other additional information.
*Check that contacts receive email (if they are mailing lists, check that outside EGI members are allowed to post there). Site administrator MUST reply to the test email.<br>
*Check that the required services for a Resource Centre are properly registered.<br>
*Check domain names and forward and reverse DNS.
|- valign="top"
| 6
| OC
|
'''Any other Operations Centre-specific requirements''' (e.g. join a certain VO and/or mailing list, etc.)
|- valign="top"
| 7
| OC
|
If all previous actions have been completed with success, notify the Resource Centre Operations Manager that the Registration is completed, and contact the Resource Infrastructure Operations Manager to notify that a new candidate Resource Centre exists and is ready to be certified.
|}
After the successful completion of all these steps, the registration phase is completed and the Resource Centre is ready for the start of the <span class="il">certification</span> phase.
= Revision History  =
{| class="wikitable"
|-
! Version
! Authors
! Date
! Comments
|-
| <br>
|
|
| <br>
|}
[[Category:Operations_Procedures]]

Latest revision as of 15:19, 23 August 2022