NGI DE:GOP/read

From EGIWiki
Jump to: navigation, search

Draft - not an approved document

Ngi-de-logo-trans.gif


NGI-DE wiki / NGI-DE GOP


GOP for reading


To edit the GOP, please select the section on NGI DE:General Operations Policy

Draft 2013

Authors

  • Wilhelm Bühler, Karlsruhe Institute of Technology (corresponding author)
  • Torsten Antoni, Karlsruhe Institute of Technology
  • Richard Grunzke, Technische Universität Dresden
  • Dimitri Nilsen, Karlsruhe Institute of Technology
  • Achim Streit, Karlsruhe Institute of Technology
  • Pavel Weber, Karlsruhe Institute of Technology
  • Mathilde Romberg, Forschungszentrum Jülich GmbH


Abstract

The mission of NGI-DE is to provide the reliable access to and the collaborative use of federated IT resources from science communities for science in Germany and worldwide.

To ensure a sustainable and seamless operation of the existing e-infrastructure a common understanding of policies and procedures on a national level is needed.

The NGI-DE General Operations Policy (NGI-DE GOP) is based on the EGI procedures and policies, the “D-Grid-Betriebskonzept” (German Grid operations concept [DGI-2-BK2012]) and German law.

The idea of the GOP is to provide a framework, where European and national procedures could be integrated.


Scope

Grid Computing in Germany started in 1997. Projects like D-Grid, EGEE and EGI-InSPIRE enabled the development of a heterogeneous and productive e-infrastructure. To ensure a sustainable and seamless operation of the existing e-Infrastructure a common understanding of policies and procedures is needed.

The GOP is a high level document without technical details. It is a permanent agreement signed by all partners. The GOP will be coordinated by the Grid Operations Centre (NGI-DE GOC) and include scope, definitions and the general policies.

Details will be put in annexes and appendixes.


Definitions

Working Language

The working language of all technical documentation is English to ensure an easy exchange of documents with Grid initiatives in Europe and beyond. Documents released before 1st of January 2013 can be in German language.


Resource Infrastructure Provider

A Resource Infrastructure Provider (RIP-DE) provides one or more basic Grid infrastructure service(s).


Resource Centre

A Resource Centre (RC-DE) operates resources.

There are three roles of persons:

  1. Operators (Resource Centre Administrator) are member of NGI-DE-OPERATIONS.
    • two mailing lists for announcements and discussions
    • monthly phone conferences
    • Operation Workshops
    • technical discussions, preparation of technical documentation,...
  2. Managers (Resource Centre Operations Manager) are part of the Resource Centre Representation (RCR-DE)
    • Political discussions, escalations, ...
    • one mailing lists for both: announcements and discussions
    • Not yet established
    • Accept Annex NGI-DE requirements on resource centres
  3. Security Contact
    • Site Security Officer

A Resource Centre offers one or more service to users called Service Endpoints.


Scientific Grid Users

A Scientific Grid Users (User-DE) uses resources.

  • Representation in the advisory board “NGI-DE-Beirat”
  • Accept Annex NGI-DE requirements on users


Grid Operations- and Support Centre

The Grid Operations- and Support Centre (NGI-DE-GOSC) is responsible for Operations and Support.

This includes:

  • Operations
    • Regional Operator on Duty (ROD)
    • Monitoring
  • Support
    • Firstline Support
    • Ticketrouting, Ticket process manager (TPM)
  • Documentation
  • Coordination
    • GOP
    • Support Units


General Policies

Resource Centre Representation

The Resource Centre Representation (RCR-DE) is the representation of the Resource Centre Operations Managers of all sites registered with NGI-DE.

not yet established


Operation Policies

NGI-DE is part of EGI and the Operational Level Agreements of EGI are part of the GOP.


Operation Procedures

The purpose of a procedure is to define the related workflow. They are periodically reviewed.

NGI-DE is part of EGI. The following EGI procedures approved by the OMB are part of the GOP.

EGI Operational Procedures are prescriptive documents that describe step-by-step processes involving several partners.

Number Title Comment Area Relevant to
EGI-PROC 01 Grid Oversight Escalation Procedure Operations ticket escalation Ticket Management Resource Centre Administrators, Operations Centres, COD
EGI-PROC 02 Operations Centre Creation Step-by-step instructions on how to create a new Operations Centre Operations Centre Management Operations Centres, COD
EGI-PROC 03 Operations Centre decommissioning Step-by-step instructions on how to decommission an Operations Centre Operations Centre Management Operations Centres, COD
EGI-PROC 04 Quality verification of monthly availability and reliability statistcs Instructions RODs and Operations Centres on how to handle justification for poor monthly performance through GGUS Availability and Monitoring Resource Centre Administrators, Operations Centres, COD
EGI-PROC 05 Validation of a Operations Centre Nagios This procedure is part of the Operations Centre creation procedure. Availability and Monitoring Operations Centres, COD
EGI-PROC 06 Setting a Nagios test status to OPERATIONS A Nagios probe is set to OPERATIONS when its results are used to generate notifications for the Operations Dashboard. This procedure details the steps to turn a Nagios test to OPERATIONs. Availability and Monitoring Operations Centres, COD
EGI-PROC 07 Adding new probes to SAM Addition of new OPS Nagios probes to the SAM release. Availability and Monitoring Resource Centre Administrators, Operations Centres, COD
EGI-PROC 08 Management of the EGI OPS Availability and Reliability Profile Request of a OPS EGI Availability and Reliability profile. A change in the profile is needed every time a new Nagios test needs to be added/removed to/from the profile, in order to have its results included/removed in/from Availability and Reliability monthly statistics. Availability and Monitoring Resource Centre Administrators, Operations Centres, COD
EGI-PROC 09 Resource Centre Registration and Certification Procedure Registration of a new Resource Centre in the GOCDB Resource Centre Management Resource Centre Administrator, Operations Centres
EGI-PROC 10 Recomputation of monitoring results and availability statistics Notification of problems with the monitoring results gathered by SAM and to request a recomputation of results and the related availability and reliability statistics Availability and Monitoring Resource Centre Administrators, Operations Centres
EGI-PROC 11 Resource Centre Decommissioning Procedure Decommissioning of a Resource Centre before it is turned into CLOSED in GOCDB Resource Centre Management Resource Centre Administrator, Operations Centres
EGI-PROC 12 Production Service Decommissioning Procedure Decommissioning of a EGI production service Resource Centre Management Resource Centre Administrator, Operations Centres
EGI-PROC 13 VO Deregistration Procedure Decommissioning of a Virtual Organization supported by the European Grid Infrastructure VO Management VO Managers, Operations Manager
EGI-PROC 14 VO Registration Procedure Registration of a Virtual Organization to the European Grid Infrastructure VO Management VO Managers, Operations Manager
EGI-PROC 15 Resource Center renaming Procedure A procedure for directly renaming a Resource Center. Resource Centre Management Resource Centre Administrator, Operations Centres


EGI Policies and Procedures

This policy is about how to integrate EGI documents.

  • The Operations Management Board (OMB) is a EGI policy board.
  • The NGI-DE representative in the OMB is responsible for the communication about changes on EGI documents.


List of Annexes

To be short technical details are defined in Annexes.

Annex “NGI-DE infrastructure services”

NGI-DE provides the reliable access to and the collaborative use of federated IT resources from science communities for science in Germany and worldwide.

  • help desk
  • monitoring facilities
  • central services for resource centres
  • central services for users
  • security services and support
  • support for users and administrators
  • information services

Infrastructure services are defined in annex “NGI-DE infrastructure services” and agreed by the representations of users and resource centres.

Annex “NGI-DE security and privacy”

German version of EGI security and privacy procedures, policies and processes.

Annex “NGI-DE requirements on users”

  • Acceptable Use Policies
  • Referencing NGI-DE in scientific papers
  • Reporting scientific work (to RIP-DE, available for all RC-DE)

Requirements on users are defined in annex “NGI-DE user requirements” and agreed by the representations of users and resource centres.

Annex “NGI-DE requirements on resource centres”

  • Procedures about “NGI-DE resource centre certification”
  • SLA
  • offering (Software, static informations about maximum walltime, number of nodes per queue, minimal/maximum job size, fairshare values)
  • Relation to RCR-DE

Resource centre requirements are defined in annex “NGI-DE resource centre requirements” and agreed by the representation of resource centres RCR-DE.

Annex “NGI-DE other procedures”

Other procedures can be defined in annex “NGI-DE other procedures” and agreed by the representation of resource centres RCR-DE. Other procedures may be used for documents to integrate into or disintegrate parts out of the NGI-DE infrastructure.

  • VO-Migration Service from D-Grid to NGI-DE/EGI
  • Site-Migration from D-Gridto NGI-DE/EGI
  • Final documentation of VO work in de-registration process of projects.



References

DGI-2-BK2012 
D-Grid operating concept (Draft in German) “Betriebskonzept für die D-Grid Infrastruktur” (Version 2.1, 2012-02-13)
http://www.d-grid.de/fileadmin/user_upload/documents/Kern-D-Grid/Betriebskonzept/D-Grid-Betriebskonzept-V2.1.pdf
EGI-GLOSSARY1 
Glossary of EGI, online: Glossary V1

Annexes

Annex “NGI-DE infrastructure services”

Appendixes:

  • help desk / first line support
  • monitoring facilities
    • D-MON, Inca, Nagios
  • central services for resource centres
    • ...
  • central services for users
    • myProxy
    • VOMS
    • central UI
  • central services for users and resource centres
  • security services and support
    • CA, AAI, CERT
  • support for users and administrators
    • UNICORE 6 Support
    • dCache
    • GT5 Support
    • Gatlet


Annex “NGI-DE security and privacy”

Appendixes:


Annex “NGI-DE requirements on users”

Appendixes:

  • NGI-DE Acceptable Use Policy
  • EGI Acceptable Use Policy
  • Referencing NGI-DE in scientific papers
  • ...


Annex “NGI-DE infrastructure services”

Appendixes:


Annex “NGI-DE other procedures”

Appendixes: