Difference between revisions of "NGI DE:GOP/read"
(Created page with "{{Ngi-de-header-gop|GOP for reading}} = Draft 2013 = {{:NGI DE:GOP-2013/Content}} {{:NGI DE:GOP-2013/Authors}} {{:NGI DE:GOP-2013/Abstract}} {{:NGI DE:GOP-2013/Scope}} {{:NGI D...") |
(No difference)
|
Revision as of 09:03, 25 October 2012
Draft - not an approved document
GOP for reading
Draft 2013
Authors
- Wilhelm Bühler, Karlsruhe Institute of Technology (corresponding author)
- Torsten Antoni, Karlsruhe Institute of Technology
- Richard Grunzke, Technische Universität Dresden
- Dimitri Nilsen, Karlsruhe Institute of Technology
- Achim Streit, Karlsruhe Institute of Technology
- Pavel Weber, Karlsruhe Institute of Technology
- Mathilde Romberg, Forschungszentrum Jülich GmbH
Abstract
The mission of NGI-DE is to provide the reliable access to and the collaborative use of federated IT resources from science communities for science in Germany and worldwide.
To ensure a sustainable and seamless operation of the existing e-infrastructure a common understanding of policies and procedures on a national level is needed.
The NGI-DE General Operations Policy (NGI-DE GOP) is based on the EGI procedures and policies, the “D-Grid-Betriebskonzept” (German Grid operations concept [DGI-2-BK2012]) and German law.
The idea of the GOP is to provide a framework, where European and national procedures could be integrated.
Scope
Grid Computing in Germany started in 1997. Projects like D-Grid, EGEE and EGI-InSPIRE enabled the development of a heterogeneous and productive e-infrastructure. To ensure a sustainable and seamless operation of the existing e-Infrastructure a common understanding of policies and procedures is needed.
The GOP is a high level document without technical details. It is a permanent agreement signed by all partners. The GOP will be coordinated by the Grid Operations Centre (NGI-DE GOC) and include scope, definitions and the general policies.
Details will be put in annexes and appendixes.
Definitions
Working Language
The working language of all technical documentation is English to ensure an easy exchange of documents with Grid initiatives in Europe and beyond. Documents released before 1st of January 2013 can be in German language.
Resource Infrastructure Provider
A Resource Infrastructure Provider (RIP-DE) provides one or more basic Grid infrastructure service(s).
Resource Centre
A Resource Centre (RC-DE) operates resources.
There are three roles of persons:
- Operators (Resource Centre Administrator) are member of NGI-DE-OPERATIONS.
- two mailing lists for announcements and discussions
- monthly phone conferences
- Operation Workshops
- technical discussions, preparation of technical documentation,...
- Managers (Resource Centre Operations Manager) are part of the Resource Centre Representation (RCR-DE)
- Political discussions, escalations, ...
- one mailing lists for both: announcements and discussions
- Not yet established
- Accept Annex NGI-DE requirements on resource centres
- Security Contact
- Site Security Officer
A Resource Centre offers one or more service to users called Service Endpoints.
Scientific Grid Users
A Scientific Grid Users (User-DE) uses resources.
- Representation in the advisory board “NGI-DE-Beirat”
- Accept Annex NGI-DE requirements on users
Grid Operations- and Support Centre
The Grid Operations- and Support Centre (NGI-DE-GOSC) is responsible for Operations and Support.
This includes:
- Operations
- Regional Operator on Duty (ROD)
- Monitoring
- Support
- Firstline Support
- Ticketrouting, Ticket process manager (TPM)
- Documentation
- Coordination
- GOP
- Support Units
General Policies
Resource Centre Representation
The Resource Centre Representation (RCR-DE) is the representation of the Resource Centre Operations Managers of all sites registered with NGI-DE.
not yet established
Operation Policies
NGI-DE is part of EGI and the Operational Level Agreements of EGI are part of the GOP.
- Resource Centre (RC) Operational Level Agreement (release notes)
- Resource infrastructure Provider (RP) Operational Level Agreement (release notes)
Operation Procedures
The purpose of a procedure is to define the related workflow. They are periodically reviewed.
NGI-DE is part of EGI. The following EGI procedures approved by the OMB are part of the GOP.
EGI Operational Procedures are prescriptive documents that describe step-by-step processes involving several partners.
Number | Title | Comment | Area | Relevant to |
---|---|---|---|---|
EGI-PROC 01 | Grid Oversight Escalation Procedure | Operations ticket escalation | Ticket Management | Resource Centre Administrators, Operations Centres, COD |
EGI-PROC 02 | Operations Centre Creation | Step-by-step instructions on how to create a new Operations Centre | Operations Centre Management | Operations Centres, COD |
EGI-PROC 03 | Operations Centre decommissioning | Step-by-step instructions on how to decommission an Operations Centre | Operations Centre Management | Operations Centres, COD |
EGI-PROC 04 | Quality verification of monthly availability and reliability statistcs | Instructions RODs and Operations Centres on how to handle justification for poor monthly performance through GGUS | Availability and Monitoring | Resource Centre Administrators, Operations Centres, COD |
EGI-PROC 05 | Validation of a Operations Centre Nagios | This procedure is part of the Operations Centre creation procedure. | Availability and Monitoring | Operations Centres, COD |
EGI-PROC 06 | Setting a Nagios test status to OPERATIONS | A Nagios probe is set to OPERATIONS when its results are used to generate notifications for the Operations Dashboard. This procedure details the steps to turn a Nagios test to OPERATIONs. | Availability and Monitoring | Operations Centres, COD |
EGI-PROC 07 | Adding new probes to SAM | Addition of new OPS Nagios probes to the SAM release. | Availability and Monitoring | Resource Centre Administrators, Operations Centres, COD |
EGI-PROC 08 | Management of the EGI OPS Availability and Reliability Profile | Request of a OPS EGI Availability and Reliability profile. A change in the profile is needed every time a new Nagios test needs to be added/removed to/from the profile, in order to have its results included/removed in/from Availability and Reliability monthly statistics. | Availability and Monitoring | Resource Centre Administrators, Operations Centres, COD |
EGI-PROC 09 | Resource Centre Registration and Certification Procedure | Registration of a new Resource Centre in the GOCDB | Resource Centre Management | Resource Centre Administrator, Operations Centres |
EGI-PROC 10 | Recomputation of monitoring results and availability statistics | Notification of problems with the monitoring results gathered by SAM and to request a recomputation of results and the related availability and reliability statistics | Availability and Monitoring | Resource Centre Administrators, Operations Centres |
EGI-PROC 11 | Resource Centre Decommissioning Procedure | Decommissioning of a Resource Centre before it is turned into CLOSED in GOCDB | Resource Centre Management | Resource Centre Administrator, Operations Centres |
EGI-PROC 12 | Production Service Decommissioning Procedure | Decommissioning of a EGI production service | Resource Centre Management | Resource Centre Administrator, Operations Centres |
EGI-PROC 13 | VO Deregistration Procedure | Decommissioning of a Virtual Organization supported by the European Grid Infrastructure | VO Management | VO Managers, Operations Manager |
EGI-PROC 14 | VO Registration Procedure | Registration of a Virtual Organization to the European Grid Infrastructure | VO Management | VO Managers, Operations Manager |
EGI-PROC 15 | Resource Center renaming Procedure | A procedure for directly renaming a Resource Center. | Resource Centre Management | Resource Centre Administrator, Operations Centres |
EGI Policies and Procedures
This policy is about how to integrate EGI documents.
- The Operations Management Board (OMB) is a EGI policy board.
- The NGI-DE representative in the OMB is responsible for the communication about changes on EGI documents.
List of Annexes
To be short technical details are defined in Annexes.
Annex “NGI-DE infrastructure services”
NGI-DE provides the reliable access to and the collaborative use of federated IT resources from science communities for science in Germany and worldwide.
- help desk
- monitoring facilities
- central services for resource centres
- central services for users
- security services and support
- support for users and administrators
- information services
Infrastructure services are defined in annex “NGI-DE infrastructure services” and agreed by the representations of users and resource centres.
Annex “NGI-DE security and privacy”
German version of EGI security and privacy procedures, policies and processes.
Annex “NGI-DE requirements on users”
- Acceptable Use Policies
- Referencing NGI-DE in scientific papers
- Reporting scientific work (to RIP-DE, available for all RC-DE)
Requirements on users are defined in annex “NGI-DE user requirements” and agreed by the representations of users and resource centres.
Annex “NGI-DE requirements on resource centres”
- Procedures about “NGI-DE resource centre certification”
- SLA
- offering (Software, static informations about maximum walltime, number of nodes per queue, minimal/maximum job size, fairshare values)
- Relation to RCR-DE
Resource centre requirements are defined in annex “NGI-DE resource centre requirements” and agreed by the representation of resource centres RCR-DE.
Annex “NGI-DE other procedures”
Other procedures can be defined in annex “NGI-DE other procedures” and agreed by the representation of resource centres RCR-DE. Other procedures may be used for documents to integrate into or disintegrate parts out of the NGI-DE infrastructure.
- VO-Migration Service from D-Grid to NGI-DE/EGI
- Site-Migration from D-Gridto NGI-DE/EGI
- Final documentation of VO work in de-registration process of projects.
References
- DGI-2-BK2012
- D-Grid operating concept (Draft in German) “Betriebskonzept für die D-Grid Infrastruktur” (Version 2.1, 2012-02-13)
- http://www.d-grid.de/fileadmin/user_upload/documents/Kern-D-Grid/Betriebskonzept/D-Grid-Betriebskonzept-V2.1.pdf
- EGI-GLOSSARY1
- Glossary of EGI, online: Glossary V1
Annexes
Annex “NGI-DE infrastructure services”
Appendixes:
- help desk / first line support
- “NGI-DE help desk” https://helpdesk.ngi-de.eu/
- monitoring facilities
- D-MON, Inca, Nagios
- central services for resource centres
- ...
- central services for users
- myProxy
- VOMS
- central UI
- central services for users and resource centres
- BDII
- gLite central Services (WMS+ LFC)
- UNICORE 6 central services, Support
- “NGI-DE accounting” (DGAS + HLRmon)
- dCache
- GT5 central services
- Gatlet
- security services and support
- CA, AAI, CERT
- support for users and administrators
- UNICORE 6 Support
- dCache
- GT5 Support
- Gatlet
Annex “NGI-DE security and privacy”
Appendixes:
- German version of EGIs “Incident Response Procedure” (from DGI-2: http://dgi-2.d-grid.de/downloads/DGI-2-security-policy-1.0.pdf)
- German version of EGIs “The Software Vulnerability Issue Handling Process” (from DGI-2: http://dgi-2.d-grid.de/downloads/dgi2-software-vulnerability-policy-m12.pdf)
- German version of EGIs “Operational Security Procedures” (from DGI-2: http://dgi-2.d-grid.de/downloads/DGI-2-operational-security-procedures-1.0.pdf)
- German version of EGIs “Grid Site Operations Policy” (from DGI-2)
Annex “NGI-DE requirements on users”
Appendixes:
- NGI-DE Acceptable Use Policy
- EGI Acceptable Use Policy
- Referencing NGI-DE in scientific papers
- ...
Annex “NGI-DE infrastructure services”
Appendixes:
- “NGI-DE resource centre certification for gLite”
- “NGI DE:GOP/resource centre certification for UNICORE”
- “NGI DE:GOP/resource centre certification for Globus”
- “NGI-DE resource centre certification for dCache”
Annex “NGI-DE other procedures”
Appendixes:
- “D-Grid procedure about end of D-Grid-VO” http://www.d-grid.de/fileadmin/user_upload/documents/Kern-D-Grid/Betriebskonzept/D-Grid-Betriebskonzept-VO-Ausscheiden-1.0.pdf
- VO-Migration Service from D-Grid to NGI-DE/EGI
- if needed
- Site Migration from D-Grid to NGI-DE/EGI
- if needed
- Final documentation of VO work in de-registration process of projects