General information
- the Operations meeting will be on the 2nd Monday of the month
- the EGI Operations Meeting schedule for first half of 2016 is available on Indico: https://indico.egi.eu/indico/categoryDisplay.py?categId=32 and on the new summary page: https://wiki.egi.eu/wiki/Operations_Meeting
UMD/CMD/Preview
- cloud middleware release 0.0.0 (pilot, only for testing purposes) is OK
- we are starting including new products in the first release: CMD-OS 1.0.0 (OpenStack Mitaka), to be inserted:
- keystone-VOMS
- ooi
- cloud BDII information provider
- https://wiki.egi.eu/wiki/EGI_Cloud_Middleware_Distribution_products -> product ID cards almost ready
- we are starting including new products in the first release: CMD-OS 1.0.0 (OpenStack Mitaka), to be inserted:
- UMD3 and UMD4 canL/Gridsite emergency releases
- UMD4/CentOS7 regular update in preparation (October release, 4.3.0)
- UI/WN for CentOS7 -> ONGOING
- ARGUS, DPM, lcas/lcas-lcmaps-gt4, davix, glexec, edg-mkgrid, ARC, XROOTD, GFAL2
- please start using UMD4/SL6 or UMD4/CentOS7 instead of UMD3/SL6
- Debian not used anymore, SL5 only security fixes, SL6 is available in UMD4 as well
- UMD4/SL6 contains products of UMD3/SL6 which give support for the next year at least, all the unsupported products are not in UMD4/SL6 (please let us know if we are missing specific products that we might have skipped!)
- for some unsupported products, we are investigating how to replace them with equivalnet products in UMD4/SL6 (see WMS)
- we need to provide a list of all the products that are in UMD3 but not migrated to UMD4
- please don't use anymore EMI3, use Preview instead!
- EMI3 not supported anymore, new message on EMI3 web pages will soon redirect from EMI3 to UMD/Preview
- more information will soon appear on http://repository.egi.eu/
- Wiki update in progress
- https://wiki.egi.eu/wiki/UMD_Release_Schedule
- https://wiki.egi.eu/wiki/UMD_products_ID_cards (only UMD4/CentOS7, UMD4/SL6 will be added too, while UMD3 products still on the old page https://wiki.egi.eu/wiki/URT:UMD_products_ID_cards)
Preview repository
Generic information about Preview repository: https://wiki.egi.eu/wiki/Preview_Repository
- latest updates released on 2016-09-14:
- Preview 1.4.0 AppDB info (sl6): ARC 15.03 update 9, FTS 3 - 3.4.7 , VOMS
- Preview 2.4.0 AppDB info (CentOS 7): ARC 15.03 update 9, dCache 2.13.42, edg-mkgridmap 4.0.3, FTS 3 - 3.4.7 , QCG-Computing 4.0.0, QCG-Notification 4.0.0, UI and WN
Note: EGI provides the preview repository without any additional quality assurance process, but the products are released as they are provided by the product team. EGI recommends the use of the UMD repositories, which contain software verified through the quality assurance process of UMD.
Operations
Software upgrades for OpenStack cloud RCs
- keystone-VOMS and cloud-info-provider updates available, need to be installed on all OpenStack sites
- as keystone-VOMS last version is only compatible with Liberty and Mitaka, in case OpenStack is Kilo (or older) an upgrade plan of OpenStack has been asked
- according to EGI policies https://wiki.egi.eu/wiki/PROC16_Decommissioning_of_unsupported_software OpenStack Kilo or older should NOT be running on the infrastructure! we are asking for discussing this point at the next OMB (October 27)
- as many sites are finding difficulties in planning upgrades against the very tight release cycle of OpenStack, please come with suggestion and reply with details in the tickets in order to shape the best (shared) proposal
- ticket campaign ONGOING for all OpenStack sites, see i.e. https://ggus.eu/?mode=ticket_info&ticket_id=124217 (this example already solved)
canL/gridsite upgrades available
- please upgrade your services as soon as possible if you have been reached by CSIRT communication https://wiki.egi.eu/wiki/SVG:Advisory-SVG-2016-11476
- also WNs must be upgraded!
- please don't miss replying to CSIRT emails!
Monthly Availability/Reliability
- September A/R figures not definitive yet
- Underperformed sites as results in August A/R report:
- CERN: SRM servers overloaded, low A/R figures since June https://ggus.eu/index.php?mode=ticket_info&ticket_id=122596
- AfricaArabia https://ggus.eu/?mode=ticket_info&ticket_id=123806
- ZA-UJ no feedback yet
- NGI_DE https://ggus.eu/index.php?mode=ticket_info&ticket_id=123836
- UNI-DORTMUNT (NGI_DE): migration to new site-bdii and CREAM-CE
- TUDresden-ZIH: no feedback yet, underperforming since too months, eligible for suspension
- NGI_GRNET https://ggus.eu/index.php?mode=ticket_info&ticket_id=123839
- waiting for the September report for closing the ticket
- NGI_IT:
- INFN-CAGLIARI https://ggus.eu/index.php?mode=ticket_info&ticket_id=123531 CE problems, working on it
- INFN-NAPOLI-CMS https://ggus.eu/index.php?mode=ticket_info&ticket_id=123840 no feedback yet
- NGI_MARGI: monitoring data missing https://ggus.eu/index.php?mode=ticket_info&ticket_id=118465
- NGI_NL https://ggus.eu/?mode=ticket_info&ticket_id=123532
- BelGrid-UCL: srm failures, issues with the cream probe, asked recomputation
- NGI_UK https://ggus.eu/index.php?mode=ticket_info&ticket_id=122614
- UKI-SOUTHGRID-SUSX: statistics are improving after SRM anc CREAM issues
- ROC_Russia https://ggus.eu/index.php?mode=ticket_info&ticket_id=123536
- Ru-Troitsk-INR-LCG2 Network issues
Decommissioning SL5
- Sites still deploying unsupported service end-points risk suspension, unless documented technical reasons prevent a Site Admin from updating these end-points https://wiki.egi.eu/wiki/PROC16_Decommissioning_of_unsupported_software#Escalation_phase see step 7
- tickets track specific status on GGUS directly managed by EGI Operations
- status on July 14th with tickets table https://wiki.egi.eu/wiki/Agenda-18-07-2016#Decommissioning_SL5
- WMS on SL5 running at IN2P3-CPPM
- it should have been upgraded in July, never done: GGUS 124338
- decommissioning procedure just started GGUS 124340
- StoRM on SL5 running at INFN-PARMA
- it was in decommissioning, downtime concluded but it has not been suspended yet GGUS 124337
- SCAI (NGI_DE): services decommissioned GGUS 122894
- some other cases handled by NGIs (one site reported for NGI_UK, dismission of SL5 will happen at the end of October)
ARGO proposal to use GOCDB as the only source of topology information
VAPOR
- new version released in September, it replaces GSTAT
- important for presenting the amount of computing and storage resources of the infrastructure
- each NGI should review the information provided by their sites and let us know any inconsistency: http://operations-portal.egi.eu/vapor/resources/GL2ResSummary
- we need your feedback to improve the service
- some known issues will be fixed in the next release
AOB
Next meeting
- 07th Nov 2016 https://indico.egi.eu/indico/event/3007/
- new calendar available until end of 2016 https://indico.egi.eu/indico/category/32/