Difference between revisions of "EGI Infrastructure operations oversight"
Line 18: | Line 18: | ||
** '''manager-central-operator-on-duty''' AT mailman.egi.eu - for COD managerial issues like suggesting changes in procedures, tools. COD managers are recipients of this list. | ** '''manager-central-operator-on-duty''' AT mailman.egi.eu - for COD managerial issues like suggesting changes in procedures, tools. COD managers are recipients of this list. | ||
** '''central-operator-on-duty''' AT mailman.egi.eu - for reporting COD day-to-day issues like problems with tools or Nagios tests. COD shifters are recipients of this list. | ** '''central-operator-on-duty''' AT mailman.egi.eu - for reporting COD day-to-day issues like problems with tools or Nagios tests. COD shifters are recipients of this list. | ||
** '''all-central-operator-on-duty''' AT mailman.egi.eu - for contacting all ROD teams in NGIs. | ** '''all-central-operator-on-duty''' AT mailman.egi.eu - for contacting all ROD teams in NGIs. Every ROD team is a recipient of this list. | ||
Revision as of 18:18, 28 January 2011
EGI.eu Operations Oversight Pages
EGI Grid Operations oversight of the e-Infrastructure is a co-ordination task for ensuring that GRID monitoring across EGI runs smoothly. This team communicates among the 3 groups - Operations and e-Infrastructure Oversight (OE); Operational Documentation (OD); and "Coordination of interoperations between NGIs and with other Grids".
The Operations oversight team works with the Tool Developers (and particularly the OTAG group), NGIs and their Operations Teams (ROD). There are regular phone meetings for the co-ordinators and others working in the tasks. The OE co-ordinators also organise face to face meetings for the ROD teams 3 to 4 times a year.
- COD managers:
- Ron Trompert (Chair), Marcin Radecki, Luuk Uljee, Malgorzata Krakowian
- COD shifters:
- Malgorzata Krakowian, Ron Trompert, Luuk Uljee, Maarten van Ingen, Ernst Pijper, Alexander Verkooijen
- Contact:
- There are 3 mailing lists used for different cases:
- manager-central-operator-on-duty AT mailman.egi.eu - for COD managerial issues like suggesting changes in procedures, tools. COD managers are recipients of this list.
- central-operator-on-duty AT mailman.egi.eu - for reporting COD day-to-day issues like problems with tools or Nagios tests. COD shifters are recipients of this list.
- all-central-operator-on-duty AT mailman.egi.eu - for contacting all ROD teams in NGIs. Every ROD team is a recipient of this list.
- Activities:
- COD managers
- representing RODs/COD in OTAG, OMB - collecting requirements and improvements proposals from RODs concerning operations tools and procedures
- taking part in OLA task force
- writing new procedures - in case of need COD is taking part in procedures creation process
- preparing ROD newsletters - informing RODs about recent and upcoming developments related to Grid Oversight
- preparing ROD metrics reports - providing an overview of operations support process in grid infrastructure.
- COD shifters.
- escalation of operational problems with RODs
- dealing with GGUS tickets assigned to COD
- process coordination of:
- creation and decommission of Operations Centre
- setting a Nagios test to an operations test
- getting explanations for low availability and reliability metrics
COD official web pages
Internal area
COD shifters daily work instructions
In this section are collected all work instructions containing detailed information specifying exactly what steps are to be followed to carry out an activity.
Action | Description | Related procedures |
---|---|---|
GGUS tickets assigned to COD |
COD shifter is obliged to check the current status of all GGUS tickets assigned to COD
If the shifter doesn't know what kind of action should be taken, he/she should contact COD managers |
|
Availability/reliability reports |
|
|
Operational portal dashboard issues | ||
Handover |
|
NOTE: all procedures should contain the following template: https://wiki.egi.eu/wiki/PDT:Procedure_Template