Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "SL5 retirement"

From EGIWiki
Jump to navigation Jump to search
 
(17 intermediate revisions by 5 users not shown)
Line 40: Line 40:
*ARC
*ARC
*UNICORE
*UNICORE
* '''don't forget to upgrade the apel client host if on SL5, we count on it to stop brokers accepting SSLv3'''


== 2016-04-18 Overall status ==
== 2016-04-18 Overall status ==
Line 48: Line 49:
* 21 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_WMS&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 WMS] and 11 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_LB&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 LB]
* 21 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_WMS&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 WMS] and 11 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_LB&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 LB]
* 8 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_VOMS&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 VOMS]
* 8 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_VOMS&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 VOMS]
* 2 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_emi.ARGUS&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 ARGUS]
* 27 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_CREAM-CE&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 CREAM-CE]
* 2 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_QCG.Computing&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 QCG Computing]
* 6 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_SRM&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 STORM]
== 2016-05-09 Overall status ==
* 3 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_Top-BDII&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 Top-BDII]
* 22 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_Site-BDII&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 Site-BDII]
* 6 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_MyProxy&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 MyProxy]
* 13 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_WMS&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 WMS] and 11 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_LB&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 LB]
* 3 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_VOMS&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 VOMS]
* 2 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_emi.ARGUS&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 ARGUS]
* 19 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_CREAM-CE&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 CREAM-CE]
* 0 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_QCG.Computing&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 QCG Computing]
* 4 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_SRM&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 STORM]
== 2016-06-13 Overall status ==
* 2 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_Top-BDII&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 Top-BDII]
* 15 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_Site-BDII&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 Site-BDII]
* 5 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_MyProxy&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 MyProxy]
* 7 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_WMS&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 WMS] and 7 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_LB&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 LB]
* 2 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_VOMS&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 VOMS]
* 1 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_emi.ARGUS&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 ARGUS]
* 12 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_CREAM-CE&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 CREAM-CE]
* 0 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_QCG.Computing&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 QCG Computing]
* 2 [https://midmon.egi.eu/nagios/cgi-bin/status.cgi?servicegroup=SERVICE_SRM&style=detail&servicestatustypes=16&hoststatustypes=15&serviceprops=0&hostprops=0 STORM]
* '''from this week on EGI Operations will start suspending sites that host services in production with SL5 and not set under downtime'''


== Reports by NGIs  ==
== Reports by NGIs  ==
Line 54: Line 86:


===AfricaArabia===
===AfricaArabia===
No report.
Only the regional SAM-Nagios box. This will be decommissioned as soon as the central monitoring is reliable.


===AsiaPacific===
===AsiaPacific===
No report.
Our NGI Nagios (sam-nagios) server is still SL5 (We tried to build an SL6 one but failed due to lack of sam-nagios SL6 package. I wonder if SL6 sam-nagios is already available now.)
 
Also, these sites: HK-HKU-CC-01, INDIACMS-TIFR, and IR-IPM-HEP still have SL5 servers.


===CERN===
===CERN===
Line 129: Line 163:


===NGI_PL===
===NGI_PL===
No report.
[Apr29]
According to our survey, all sites should meet the deadline.
 
National holidays, and "long weekend" starting 30.04-03.05. In case some sites fail tests on 01.05, please give us a little bit more time till 06.05, before starting suspension procedure :)
 
For sure, we do not upgrade Nagios boxes (6 instances including vo nagioses and test instances).


===NGI_RO===
===NGI_RO===
Line 138: Line 177:


===NGI_SK===
===NGI_SK===
Only NGI instances of SAM-nagios are on SL5.
No SL5 services.


===NGI_TR===
===NGI_TR===
No report.
Only NGI instances of SAM-nagios are on SL5.


===NGI_UA===
===NGI_UA===
Line 147: Line 186:


===NGI_UK===
===NGI_UK===
DPM:
Non-Squid/DPM services:


Several sites: we’d like to clarify what timescale would be appropriate if we wanted to decommission SL5 pool/head nodes over the coming months rather than migrate.
In progress:


Squids:
GLASGOW:


With a note that squids are not covered in the probes/exceptions: are these included (some sites have SL5 squids)?
- WMS/LB: Being decommissioned following PROC12: tracked at https://ggus.eu/?mode=ticket_info&ticket_id=120973


Non-Squid/DPM services:
LANCASTER:


BRUNEL:
- BDII: SL5 BDII due to be upgraded


- BDII: SL5 BDII due for retirement
OXFORD:


RHUL:
- SAM/VO Nagios SL5 only as with other NGIs (as discussed)


- CREAM
T1:
- BDII: CentOS 7 being investigated


BRISTOL:
Castor SRM systems. SRM upgrade waiting on Castor upgrade.


- CREAM: 1 VMHost, 2 x CREAM. CREAM to be moved to HTCondor CE (end of May)
Complete:


GLASGOW:
BRUNEL:


- WMS/LB: Due for decommissioning shortly
- BDII: SL5 BDII retired


LANCASTER:
RHUL:


- BDII: SL5 BDII due for upgrade after Easter
- CREAM & BDII upgraded


OXFORD:
BRISTOL:


- SAM/VO Nagios SL5 only as with others.
- SL5 CREAM decommissioned


T1:
===ROC_Canada===


Castor SRM systems. Small number of internal service machines. Database systems on Red Hat Enterprise 5. Timeline: SRMs and others - still to be defined.
Only the ROC SAM-Nagios box is still with SL5 This will be decommissioned as soon as the central monitoring is reliable.
 
===ROC_Canada===
No report.


===ROC_LA===
===ROC_LA===
Line 193: Line 228:


===Russia===
===Russia===
No report.
[Apr27] the last SL5-powered service in Russian region was reinstalled into CentOS 6 this week, https://goc.egi.eu/portal/index.php?Page_Type=Downtime&id=20542
so ROC_Russia now is free of SL5 machines visible via midmon.  There are 3 more disk pools running SL5 over the whole region, we're trying to follow up on their upgrade, though it will take some time.

Latest revision as of 08:08, 10 October 2016


The aim of this page is tracking the progressive decommissioning of SL5 from sites.

UMD is getting ready to support CentOS7. Supporting a new platform will require to decommission a previously support O.S. to make available new resources. EGI Operations will like to decommission Scientific Linux 5.

Please start tracking which sites are still using SL5 services: how many services, and for each service if still needed on SL5, if upgrades on SL5 services are expected). Also interesting to understand who is using Debian.

From September 2015 on, UMD3 will support only security fixes for SL5. Security updates will be available until March 2016.

All SL5 services must be decommissioned by April 2016. On February probes will start warning about SL5 services to be decommissioned.

Description of the process

Log

[2016-03-01] Request to turn WARNINGs into CRITICAL: https://ggus.eu/?mode=ticket_info&ticket_id=118901#update#21

[2016-02-08] SL5 Tests ready on midmon, with corresponding documentation

[2016-01-15] Preparation of probe requested https://ggus.eu/?mode=ticket_info&ticket_id=118901

[2016-01-15] UMD4 released with CentOS7

[2015-12-17] SL5 retirement announced at OMB https://indico.egi.eu/indico/getFile.py/access?contribId=5&resId=0&materialId=slides&confId=2383

[2015-07-13] SAM-nagios is still SL5, no SL6 available. ARGO will get fully centralized, there is no need to use SAM-nagios anymore, so regional instances won't be used anymore.

Services that are not probed by midmon

  • dCache HINT: for installations exposing the web admin interface, you can go to: http://<hostname>:2288/info and check within the XML for the "<os>" tag
  • DPM
  • ARC
  • UNICORE
  • don't forget to upgrade the apel client host if on SL5, we count on it to stop brokers accepting SSLv3

2016-04-18 Overall status

2016-05-09 Overall status


2016-06-13 Overall status

  • from this week on EGI Operations will start suspending sites that host services in production with SL5 and not set under downtime

Reports by NGIs

Please report here which SL5 services are running, at which sites, and the corresponding dismission plan.

AfricaArabia

Only the regional SAM-Nagios box. This will be decommissioned as soon as the central monitoring is reliable.

AsiaPacific

Our NGI Nagios (sam-nagios) server is still SL5 (We tried to build an SL6 one but failed due to lack of sam-nagios SL6 package. I wonder if SL6 sam-nagios is already available now.)

Also, these sites: HK-HKU-CC-01, INDIACMS-TIFR, and IR-IPM-HEP still have SL5 servers.

CERN

No report.

EGI

No report.

IDGF

No report.

NGI_AEGIS

No report.

NGI_ARMGRID

No report.

NGI_BG

No report.

NGI_CH

No report.

NGI_CHINA

No report.

NGI_CZ

No report.

NGI_DE

No report.

NGI_FI

No report.

NGI_FRANCE

No report.

NGI_GE

No report.

NGI_GRNET

No report.

NGI_HR

No report.

NGI_HU

No report.

NGI_IBERGRID

No report.

NGI_IL

No report.

NGI_IT

No report.

NGI_MARGI

No report.

NGI_MD

No report.

NGI_NDGF

No report.

NGI_NL

No report.

NGI_PL

[Apr29] According to our survey, all sites should meet the deadline.

National holidays, and "long weekend" starting 30.04-03.05. In case some sites fail tests on 01.05, please give us a little bit more time till 06.05, before starting suspension procedure :)

For sure, we do not upgrade Nagios boxes (6 instances including vo nagioses and test instances).

NGI_RO

The only SL5 is the one holding ngi.SAM (ngi-ro-nagios.nipne.ro).

NGI_SI

No report.

NGI_SK

No SL5 services.

NGI_TR

Only NGI instances of SAM-nagios are on SL5.

NGI_UA

No report.

NGI_UK

Non-Squid/DPM services:

In progress:

GLASGOW:

- WMS/LB: Being decommissioned following PROC12: tracked at https://ggus.eu/?mode=ticket_info&ticket_id=120973

LANCASTER:

- BDII: SL5 BDII due to be upgraded

OXFORD:

- SAM/VO Nagios SL5 only as with other NGIs (as discussed)

T1:

Castor SRM systems. SRM upgrade waiting on Castor upgrade.

Complete:

BRUNEL:

- BDII: SL5 BDII retired

RHUL:

- CREAM & BDII upgraded

BRISTOL:

- SL5 CREAM decommissioned

ROC_Canada

Only the ROC SAM-Nagios box is still with SL5 This will be decommissioned as soon as the central monitoring is reliable.

ROC_LA

No report.

Russia

[Apr27] the last SL5-powered service in Russian region was reinstalled into CentOS 6 this week, https://goc.egi.eu/portal/index.php?Page_Type=Downtime&id=20542 so ROC_Russia now is free of SL5 machines visible via midmon. There are 3 more disk pools running SL5 over the whole region, we're trying to follow up on their upgrade, though it will take some time.