EGI-InSPIRE:SA1.7-QR6
1. Task Meetings
Date (dd/mm/yyyy) | Url Indico Agenda | Title | Outcome |
---|---|---|---|
2. Main Achievements
Grid Oversight
ROD teams news letter
This quarter we have published a ROD teams newsletter in August and October. The rationale behind the newsletter is descibed in the QR4 report.
ROD teams questionnaire Some time ago we have send out a questionnaire to you. The reason for this was that we wanted to have your on opinion on how you perceive your work. We would like to your opinion on the operational tools, documentation, video tutorials, and this newsletter etcetera. We have got no less that 44 responses which we found very valuable. From 12 NGIs we have got more than one response. Thank you very much for taking the time to answer the questions. The outcome was discussed during the Grid Oversight session at the EGI Tech Forum (https://www.egi.eu/indico/getFile.py/access?contribId=35&resId=0&materialId=slides&confId=452).
ROD session at EGI TF
In this edition of the EGI Tech Forum we have organised a 1.5 hour session where we have had three topics. There was a presentation of the COD staff on the new simplified escalation procedure that came into effect as of October 1st. Also the ROD metrics were discussed and its incorporation in the OLA. This topic caused a fair amount of discussion. The outcome of this discussion was that these metrics will continuously be collected and published in this newsletter. Later on we will restart the discussion on how this should enter the OLA. Finally, results were presented of an investigation of the reason for closing alarms in non-OK status and some tips were given on how to do this properly. Next, there was a presentation by COD staff on the results of the survey that we have held among our RODs about the work that they do. There were questions about the operational tools, documentation etcetera. In any case, the COD has provided their feedback on this in this slot. A good thing was that the operational tools developers Cyril l’Orphelin and Emir Imamagic were in the audience so a part of the slot became a Q&A sessions between users of the operational tools and developers which was very useful.
Finally, Cyril ‘lOrphelin gave an interesting presentation on the recent developments and improvements of the dashboard. There is going to be a security dashboard to detect and inform sites about security issues. Further there is also going to be a VO-oriented dashboard. Links to the presentations may be found at: https://www.egi.eu/indico/contributionDisplay.py?contribId=35&confId=452
Procedures
Availability followup
- There is a Nagios probe under development that is going to raise an alarm when a site's avaliability and/or reliability is below the 70%/75% threshold. The COD has provided input which was put into the RT ticket: https://rt.egi.eu/rt/Ticket/Display.html?id=289. We have organised a phone conf on the requirements that this probe should fulfill. We have done a new proposal in this field and hope to get aan agreement from all parties involved so this issues can make some progress.
- Unknown issue
TPM
Network Support
3. Issues and Mitigation
Issue Description | Mitigation Description |
---|---|