Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

EGI-InSPIRE:SA1.7-QR4

From EGIWiki
Revision as of 14:39, 2 May 2011 by Ron (talk | contribs)
Jump to navigation Jump to search

1. Task Meetings

Date (dd/mm/yyyy) Url Indico Agenda Title Outcome
26-01-2010 https://www.egi.eu/indico/conferenceDisplay.py?confId=315 CODOC meeting with COO https://www.egi.eu/indico/getFile.py/access?resId=0&materialId=minutes&confId=315
26-01-2010 https://www.egi.eu/indico/conferenceDisplay.py?confId=314 CODOC https://www.egi.eu/indico/getFile.py/access?resId=0&materialId=minutes&confId=314

2. Main Achievements

Grid Oversight

ROD teams news letter

The transition from EGEE to EGI InSPIRE came about with a lot of changes. For Operations, the EGEE Regional Operations Centres, called ROCs, are in the process of being dismantled and their responsibilities transferred to the NGIs, or have already completed this process. In the EGI era, ROD teams will monitor the quality of sites in their country or region, whereas COD is responsible for the global oversight over the whole EGI infrastructure. This is to provide a high-quality grid infrastructure to the user communities. These changes have also leaded us to think about how COD and ROD are going to interact with each other in this new setting. During the Grid Oversight session at the EGI Tech Forum it was made clear to us that people find it cumbersome to travel in order to have regular face to face meetings. Nevertheless, we do feel the need to create and maintain a coherent and alive Grid Oversight community and to have interaction between ROD and COD that goes beyond the dashboards. This is necessary, in our view, to create a top-quality grid infrastructure for our users. For this reason we have created this newsletter. The purpose of this newsletter is to inform you about recent and upcoming developments related to Grid Oversight and to show to you the metrics indicating how well we did the past month. We have published newsletters since december 2011. We will continue to do this on a monthly basis.

ROD session at EGI UF

At the EGI User Forum in Vilnius, we have organised a ROD teams session. During the ROD session there were four presentations. The first one was from Marcin Radecki discussing the Grid Oversight work. In the second presentation, Gonçalo Borges from the NGI IBERGRID gave a very nice presentation on the IBERGRID operations and their experiences with the regionalised operational tools. Finally there was a slot on operational tools where two presentations were given by Cyril L'Orphelin on the status and roadmap of the operational portal and Emir Imamagic on the SAM roadmap. The presentations can be downloaded from: https://www.egi.eu/indico/sessionDisplay.py?sessionId=9&confId=207#20110411. We were very pleased with the fact that no less than 35 people were attending this session.

Tutorial videos

COD team has started using new technology to pass info to ROD members. You can now learn your duties by watching our video tutorials! The series will contain 6 parts:

   1. How to become a ROD member – 7 steps which should be done to become a ROD member
   2. Operations tools – a brief introduction of operations tools needed by a ROD member to perform their duties
   3. How to handle alarms – an instruction how to manage alarms on the Operations Portal (ticket creation, closing and masking alarms)
   4. How to handle tickets – an instruction how to manage tickets on the Operations Portal (ticket creation, updating and closing tickets)
   5. Issues escalated to COD – an introduction of cases which are escalated to COD and how to deal with them
   6. Operations portal – a brief introduction of the Operations Portal tools

Currently the first two videos are available and you can find links to them on ROD wiki page: https://wiki.egi.eu/wiki/Grid_operations_oversight/ROD#Videos_tutorials. All videos will be uploaded to YouTube soon.

TPM

TPM activity is done by two teams, which are in permanent contact, so no extra meetings are required to organize the daily work. TPM can be considered as a very reliable service. A prototype of the Technology Helpdesk (EMI/IGE/SAGA) was presented in Vilnius. It is a separate GGUS instance to deal with middleware related tickets. TPM should be able to identify these tickets and assign it to DMSU.

Network Support

3. Issues and Mitigation

Issue Description Mitigation Description
Grid Oversight: None
TPM: None

4. Plans for the next period

Grid Oversight

1. Continue ROC transition to NGIs.

2. Continue investigation of the impact on operations support model related to new middlewares in EGI.

3. Continue the investigation on how to improve availability and reliability metrics.

4. Evaluation of upcoming new releases of the operational dashboard.

5. Finish the tutorial videos.

TPM

Plans shall be worked out to further automate TPMs work and how the monitoring of untouched tickets could be improved. A workshop with all people involved in the TPM task could help with this. In preparation for this workshop ticket statistics and analysis will be done.


Network Support