Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "EGI-InSPIRE:Sa1 tasks"

From EGIWiki
Jump to navigation Jump to search
 
(18 intermediate revisions by 5 users not shown)
Line 1: Line 1:
{{Template:Op menubar}}
{{EGI-Inspire_menubar}}
{{TOC_right}}
{{TOC_right}}  
 
This page explains how operational activities map to EGI-InSPIRE SA1 tasks and provides guidance to SA1 staff on how to report activities and effort.
This page explains how operational activities map to EGI-InSPIRE SA1 tasks and provides guidance to SA1 staff on how to report activities and effort.
For a complete overview of EGI InSPIRE activities, please refer to the [https://documents.egi.eu/document/10 EGI-InSPIRE Description of Work].
For a complete overview of EGI InSPIRE activities, please refer to the [https://documents.egi.eu/document/10 EGI-InSPIRE Description of Work].
Line 10: Line 11:
* (NGI and EGI.eu) Review of SA1 and non-SA1 milestones and deliverables
* (NGI and EGI.eu) Review of SA1 and non-SA1 milestones and deliverables


= Task 2: A Secure Infrastructure =  
= Task 2: A [[Security|Secure Infrastructure]] =  
(Task leader: M. Ma - STFC)
(Task leader: D. Kelsey - STFC)


* (NGI) participation to [https://wiki.egi.eu/wiki/Csirt EGI CSIRT]. EGI CSIRT covers all aspects of operational security aimed at achieving a secure infrastructure within EGI. It ensures both the coordination with peer grids and with the NGIs and NREN CSIRTs.
* (NGI) participation to [[Csirt |EGI CSIRT]]. EGI CSIRT covers all aspects of operational security aimed at achieving a secure infrastructure within EGI. It ensures both the coordination with peer grids and with the NGIs and NREN CSIRTs.
** Incident Response Task Force (IRTF) Handle day to day operational security issues, coordinates Computer-Security-Incident-Response
** Incident Response Task Force (IRTF) Handle day to day operational security issues, coordinates Computer-Security-Incident-Response
** Security Drills Group (SDG) testing of the inter CSIRT communication channels and assessing the Site-Security-Teams readiness to handle IT-security incidents.
** Security Drills Group (SDG) testing of the inter CSIRT communication channels and assessing the Site-Security-Teams readiness to handle IT-security incidents.
Line 19: Line 20:
** Training and Dissemination Group (TDG) Raise security awareness and improve security for system administrators by providing security training and best practice
** Training and Dissemination Group (TDG) Raise security awareness and improve security for system administrators by providing security training and best practice


* (NGI) participation to the EGI Software Vulnerability Group ([https://wiki.egi.eu/wiki/SVG SVG]). The purpose of the EGI Software Vulnerability Group is to eliminate existing vulnerabilities from the deployed infrastructure, primarily from the grid middleware, prevent the introduction of new ones and prevent security incidents.
* (NGI) participation to the EGI Software Vulnerability Group ([[SVG |SVG]]). The purpose of the EGI Software Vulnerability Group is to eliminate existing vulnerabilities from the deployed infrastructure, primarily from the grid middleware, prevent the introduction of new ones and prevent security incidents.


* (EGI Global task) Coordination of [http://www.eugridpma.org/ EUGridPMA] - D. Groep, Nikhef
* (EGI Global task) Coordination of [http://www.eugridpma.org/ EUGridPMA] - D. Groep, Nikhef
* (EGI Global task) Coordination of operational security - M. Ma, STFC
* (EGI Global task) Coordination of operational security - M. Ma, STFC


= Task 3: Service Deployment and Validation =  
= Task 3: [[Staged-Rollout|Software Deployment ]] and [[Interoperations]] =
(Task leader: M. David - LIP, Dep. M. Lechner - KTH)
 
* (NGI) Participation to staged rollout of middleware component updates
(Task leader: J. Pina - LIP, M. Krakowian - EGI.eu)  
* (NGI) Interoperability at the NGI level: operation of multiple middleware stacks and novel resources (virtualization, HPC, etc.), interoperability with regional grids, feeding interoperability requirements into EGI
 
*(NGI) Participation to staged rollout of middleware component updates  
*(NGI) Interoperability at the NGI level: operation of multiple middleware stacks and novel resources (virtualization, HPC, etc.), interoperability with regional grids, feeding interoperability requirements into EGI


* (EGI global task) Definition and implementation of a new workflow to automate software deployment cycle, from release of the sw developers to deployment
*(EGI global task) Definition and implementation of a new workflow to automate software deployment cycle, from release of the sw developers to deployment  
* (EGI global task) Coordination of the staged rollout activities carried out by the NGIs
*(EGI global task) Coordination of the staged rollout activities carried out by the NGIs  
* (EGI global task) Liaison with gLite Collaboration release team (interim)
*(EGI global task) Liaison with gLite Collaboration release team (interim)  
* (EGI global task) Chairing the operations meetings
*(EGI global task) Chairing the operations meetings  
* (EGI global task) Operational interoperation between NGIs and with international Grids, e.g. of the accounting and monitoring infrastructure, gathering or requirements  
*(EGI global task) Operational interoperation between NGIs and with international Grids, e.g. of the accounting and monitoring infrastructure, gathering or requirements


= Task 4: Infrastructure for Grid Management =  
= Task 4: Infrastructure for Grid Management ([[Tools|Operational Tools]])=  
(Task leader: E. Imamagic - SRCE)
(Task leader: E. Imamagic - SRCE)
* (NGI) Deployment (system management and validation) of regional instance of operational tools:
* (NGI) Deployment (system management and validation) of regional instance of operational tools:
Line 47: Line 50:
** GOCDB (STFC), operations portal and dashboard C. L'Orphelin (IN2P3), central Nagios-based SAM components and MyEGI W. Lapka (CERN), central messaging infrastructure C. Kanellopoulos (AUTH), central network monitoring tools (downcollector) M. Reale (GARR)
** GOCDB (STFC), operations portal and dashboard C. L'Orphelin (IN2P3), central Nagios-based SAM components and MyEGI W. Lapka (CERN), central messaging infrastructure C. Kanellopoulos (AUTH), central network monitoring tools (downcollector) M. Reale (GARR)
** Migration plan to EGI.eu domain, failover configurations
** Migration plan to EGI.eu domain, failover configurations
** Coordination of regional deployment of tools (e.g. Nagios)  
** Coordination of regional deployment of tools (e.g. Nagios)


= Task 5: Accounting =  
= Task 5: Accounting =  
(Task leader: J. Gordon, STFC)
(Task leader: A. Packer, STFC)
* (NGI) Deployment (system management and validation) of regional instance of accounting infrastructure:
* (NGI) Deployment (system management and validation) of regional instance of accounting infrastructure:
** Regional accounting portal  
** Regional accounting portal  
** Regional accounting repository  
** Regional accounting repository  
** Check for correctness and completeness of accounting information
** Check for correctness and completeness of accounting information
** Chaise sites not publishing accounting records
** Chase sites not publishing accounting records
** Provide feedback on accounting requirements  
** Provide feedback on accounting requirements  


* (EGI Global task)
* (EGI Global task)
** Central accounting repository - G. Mathieu (STFC)
** Central accounting repository - J. Gordon (STFC)
** Central accounting portal - J. Lopez Cacheiro (CESGA)
** Central accounting portal - I. Díaz Álvarez (CESGA)
** Gathering new accounting requirements  
** Gathering new accounting requirements


= Task 6: Helpdesk Infrastructure =  
= Task 6: Helpdesk Infrastructure =  
(Task leader: T. Antoni - KIT)
(Task leader: G. Grein - KIT)


* (NGI) Regional helpdesk
* (NGI) Regional helpdesk
Line 74: Line 77:
*** provide an FAQ based on a template provided by GGS which describes your NGI support infrastructure, a contact email address  
*** provide an FAQ based on a template provided by GGS which describes your NGI support infrastructure, a contact email address  
*** people concerned in the helpdesk activities must [https://gus.fzk.de/admin/get_account.php?accounttype=support register] as GGUS support staff here:
*** people concerned in the helpdesk activities must [https://gus.fzk.de/admin/get_account.php?accounttype=support register] as GGUS support staff here:
** Provide feedback about helpdesk requirements in the framework of the [https://wiki.egi.eu/wiki/USAG USAG] group
** Provide feedback about helpdesk requirements in the framework of the [[USAG]] group
   
   
* (EGI Global task) Deployment of central EGI helpdesk (GGUS):
* (EGI Global task) Deployment of central EGI helpdesk (GGUS):
Line 87: Line 90:
* (NGI) Regional support
* (NGI) Regional support
** 1st and 2nd line support to national users and NGI sites
** 1st and 2nd line support to national users and NGI sites
** Grid oversight of an NGI (aka ROD)
** Grid oversight of an NGI (aka [[Grid_operations_oversight|ROD]])
** NGI network support  
** NGI network support  


* (EGI Global task)  Central support
* (EGI Global task)  Central support
** Grid oversight (COD) – Netherlands, Poland following up of escalated operational issues
** Grid oversight ([[Grid_operations_oversight|COD]]) – Netherlands, Poland following up of escalated operational issues
** Site suspension
** Site suspension
** Follow up of sites failing to provide 70%/75% monthly availability/reliability
** Follow up of sites failing to provide 70%/75% monthly availability/reliability
** Ticket Processing Management (TPM) – first line support, coord: H. Dres (KIT), A. Paolini (INFN)
** Ticket Processing Management (TPM) – first line support, coord: H. Dres (KIT), A. Paolini (INFN)
** Coordination of network support - M. Reale (GARR)
** Coordination of network support - M. Reale (GARR)
* (EGI Global task) Software support
(Leader: A. Krenek - CESNET)


= Task 8 Providing a Reliable Grid Infrastructure =  
= Task 8 Providing a Reliable Grid Infrastructure =  
(Task leader: C. Kanellopoulos, AUTH)
(Task leader: Paschalis Korosoglou, AUTH)


* NGI:
* NGI:
** Deployment of NGI core middleware services (top-BDII, FTS, WMS/LB, central LFC, VOMS)
** Deployment of NGI core middleware services (top-BDII, FTS, WMS/LB, central LFC, VOMS)
** Operation of national CA
** Operation of national CA
** Follow up of availability issues in the country
** [[Performance|Follow up of availability issues]] in the country
** Contribute to the maintenance and development of documentation, procedures and best practices
** Contribute to the maintenance and development of documentation, procedures and best practices
** Site certification  
** Site certification  


* (EGI Global Task)
* (EGI Global Task - [[Catch_All_Grid_Core_Services|Catch-all Services]])
** Deployment of central grid middleware services (aka catch-all) - C. Kanellopoulos (AUTH)  
** Deployment of central grid middleware services (aka catch-all) - C. Kanellopoulos (AUTH)  
** Dteam VOMS C. Kanellopoulos (AUTH)  
** Dteam VOMS C. Kanellopoulos (AUTH)  
Line 115: Line 121:
** Validation and distribution of monthly availability/reliability statistics
** Validation and distribution of monthly availability/reliability statistics
** Enhancement/extensions of Operational Level Agreements
** Enhancement/extensions of Operational Level Agreements
** Maintenance and development (coordination of) operational documentation, procedures, best practices - V. Hansper (CSC)
** Maintenance and development (coordination of) operational documentation, procedures, best practices - M. Krakowian (EGI.eu)

Latest revision as of 22:26, 24 December 2014

EGI Inspire Main page



This page explains how operational activities map to EGI-InSPIRE SA1 tasks and provides guidance to SA1 staff on how to report activities and effort. For a complete overview of EGI InSPIRE activities, please refer to the EGI-InSPIRE Description of Work.

Task 1: Activity Management

(Task leader: T. Ferrari, EGI.eu)

  • (NGI) participation to the Operations Management Board (OMB)
  • (NGI) participation to bi-weekly operations meetings
  • (NGI and EGI.eu) Review of SA1 and non-SA1 milestones and deliverables

Task 2: A Secure Infrastructure

(Task leader: D. Kelsey - STFC)

  • (NGI) participation to EGI CSIRT. EGI CSIRT covers all aspects of operational security aimed at achieving a secure infrastructure within EGI. It ensures both the coordination with peer grids and with the NGIs and NREN CSIRTs.
    • Incident Response Task Force (IRTF) Handle day to day operational security issues, coordinates Computer-Security-Incident-Response
    • Security Drills Group (SDG) testing of the inter CSIRT communication channels and assessing the Site-Security-Teams readiness to handle IT-security incidents.
    • Security Monitoring Group (SMG) Develop, deploy and maintain security monitoring tools.
    • Training and Dissemination Group (TDG) Raise security awareness and improve security for system administrators by providing security training and best practice
  • (NGI) participation to the EGI Software Vulnerability Group (SVG). The purpose of the EGI Software Vulnerability Group is to eliminate existing vulnerabilities from the deployed infrastructure, primarily from the grid middleware, prevent the introduction of new ones and prevent security incidents.
  • (EGI Global task) Coordination of EUGridPMA - D. Groep, Nikhef
  • (EGI Global task) Coordination of operational security - M. Ma, STFC

Task 3: Software Deployment and Interoperations

(Task leader: J. Pina - LIP, M. Krakowian - EGI.eu)

  • (NGI) Participation to staged rollout of middleware component updates
  • (NGI) Interoperability at the NGI level: operation of multiple middleware stacks and novel resources (virtualization, HPC, etc.), interoperability with regional grids, feeding interoperability requirements into EGI
  • (EGI global task) Definition and implementation of a new workflow to automate software deployment cycle, from release of the sw developers to deployment
  • (EGI global task) Coordination of the staged rollout activities carried out by the NGIs
  • (EGI global task) Liaison with gLite Collaboration release team (interim)
  • (EGI global task) Chairing the operations meetings
  • (EGI global task) Operational interoperation between NGIs and with international Grids, e.g. of the accounting and monitoring infrastructure, gathering or requirements

Task 4: Infrastructure for Grid Management (Operational Tools)

(Task leader: E. Imamagic - SRCE)

  • (NGI) Deployment (system management and validation) of regional instance of operational tools:
    • regional Nagios and MyEGI portal (mandatory)
    • regional GOCDB (prototype available)
    • regional operations portal and dashboard
    • regional accounting portal (prototype available)
    • Provide feedback on tool deployment issues and requirements
  • (EGI global task) Deployment of central operational tools
    • GOCDB (STFC), operations portal and dashboard C. L'Orphelin (IN2P3), central Nagios-based SAM components and MyEGI W. Lapka (CERN), central messaging infrastructure C. Kanellopoulos (AUTH), central network monitoring tools (downcollector) M. Reale (GARR)
    • Migration plan to EGI.eu domain, failover configurations
    • Coordination of regional deployment of tools (e.g. Nagios)

Task 5: Accounting

(Task leader: A. Packer, STFC)

  • (NGI) Deployment (system management and validation) of regional instance of accounting infrastructure:
    • Regional accounting portal
    • Regional accounting repository
    • Check for correctness and completeness of accounting information
    • Chase sites not publishing accounting records
    • Provide feedback on accounting requirements
  • (EGI Global task)
    • Central accounting repository - J. Gordon (STFC)
    • Central accounting portal - I. Díaz Álvarez (CESGA)
    • Gathering new accounting requirements

Task 6: Helpdesk Infrastructure

(Task leader: G. Grein - KIT)

  • (NGI) Regional helpdesk
    • Deployment of regional helpdesk interworking with GGUS (optional)
      • set up a regional ticketing system, make use of a xGUS regional, instance or use GGUS directly
      • ensure the stable operation of the regional ticketing system, if applicable
      • get in contact with GGUS staff support at ggus.org to clarify the specification of the interface with GGUS or your xGUS customisation
      • register your NGI as a new SU at the GGUS Savannah or open a GGUS ticket for this
      • provide an FAQ based on a template provided by GGS which describes your NGI support infrastructure, a contact email address
      • people concerned in the helpdesk activities must register as GGUS support staff here:
    • Provide feedback about helpdesk requirements in the framework of the USAG group
  • (EGI Global task) Deployment of central EGI helpdesk (GGUS):
    • System management of the tool and deployment of new releases
    • Implementation of failover configurations
    • Maintenance of GGUS Support Units
    • Validation of GGUS ticket reports

Task 7: Support Teams

(Task leader: Ron Trompert, SARA)

  • (NGI) Regional support
    • 1st and 2nd line support to national users and NGI sites
    • Grid oversight of an NGI (aka ROD)
    • NGI network support
  • (EGI Global task) Central support
    • Grid oversight (COD) – Netherlands, Poland following up of escalated operational issues
    • Site suspension
    • Follow up of sites failing to provide 70%/75% monthly availability/reliability
    • Ticket Processing Management (TPM) – first line support, coord: H. Dres (KIT), A. Paolini (INFN)
    • Coordination of network support - M. Reale (GARR)
  • (EGI Global task) Software support

(Leader: A. Krenek - CESNET)

Task 8 Providing a Reliable Grid Infrastructure

(Task leader: Paschalis Korosoglou, AUTH)

  • NGI:
    • Deployment of NGI core middleware services (top-BDII, FTS, WMS/LB, central LFC, VOMS)
    • Operation of national CA
    • Follow up of availability issues in the country
    • Contribute to the maintenance and development of documentation, procedures and best practices
    • Site certification
  • (EGI Global Task - Catch-all Services)
    • Deployment of central grid middleware services (aka catch-all) - C. Kanellopoulos (AUTH)
    • Dteam VOMS C. Kanellopoulos (AUTH)
    • VOMS service for VOs requesting it C. Kanellopoulos (AUTH)
    • Central Nagios infrastructure for security monitoring
    • Catch all CA
    • Validation and distribution of monthly availability/reliability statistics
    • Enhancement/extensions of Operational Level Agreements
    • Maintenance and development (coordination of) operational documentation, procedures, best practices - M. Krakowian (EGI.eu)