Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "Agenda-20-10-2014"

From EGIWiki
Jump to navigation Jump to search
 
(34 intermediate revisions by 3 users not shown)
Line 40: Line 40:


== 1.3 Staged rollout updates  ==
== 1.3 Staged rollout updates  ==


* ARC v. 4.2.0 (13.11.2)
* ARC v. 4.2.0 (13.11.2)
* bdii-core 1.6.0 & Glue-validator 2.0.24
* cream-slurm v. 1.0.2 ('''new''')
* qcg-broker-3.4.0
* emi-wn v. 3.1.0 ('''new''')
* qcg-broker-client-3.4.0
* qcg-broker-egi-is-provider-3.4.0
* qcg-egi-is-conf-3.4.0


=== In Verification ===
=== In Verification ===
* canl v. 2.2.3
* canl v. 2.2.3
* cream-ge v. 2.2.0
* cream-ge v. 2.2.0
* cream-lsf v. 2.0.4
* cream-slurm v. 1.0.2
* cream-torque v. 2.1.4
* cream-torque v. 2.1.4
* dcache v. 2.6.31
* dcache v. 2.6.35
* dcache-srm-client v. 2.6.12
* dcache-srmclient v. 2.2.27
* cgsi-gsoap v. 1.3.6
* cgsi-gsoap v. 1.3.6
* emi-wn v. 3.1.0
* emi-ui v. 3.1.0
* emi-ui v. 3.1.0
* srm-ifce v. 1.20.1
* srm-ifce v. 1.20.1
* bdii-core v. 1.6.0
* wms v. 3.6.6
* wms v. 3.6.6
* apel v. 1.2.2
* apel v. 1.2.2
Line 78: Line 72:
* qcg-computing-3.4.0  
* qcg-computing-3.4.0  
* qcg-notifications-3.4.0
* qcg-notifications-3.4.0
* qcg-broker-3.4.0
* qcg-broker-client-3.4.0
* qcg-broker-egi-is-provider-3.4.0
* qcg-egi-is-conf-3.4.0
* cream-lsf v. 2.0.4
* bdii-core 1.6.0 & Glue-validator 2.0.25
* mpi v. 1.5.3


=== UMD 3 EA ===
=== UMD 3 EA ===
Line 101: Line 102:
** https://ggus.eu/index.php?mode=ticket_info&ticket_id=101486
** https://ggus.eu/index.php?mode=ticket_info&ticket_id=101486
* This issue is already being investigated at '''3rd level''' but PTs cannot decide who is responsible ant DMSU is overseeing.
* This issue is already being investigated at '''3rd level''' but PTs cannot decide who is responsible ant DMSU is overseeing.
=== CREAM CLI/GridSite SegFaults at Long-Lived Proxies ===
* <code>glite-ce-job-submit</code> crashes if the user's proxy certificate has a lifetime exceeding 240 hours (10 days)
* 2 issues:
** one in GridSite for the segfault, forwarded to the GridSite PT to fix - '''UPDATE''' - fix provided wih [https://github.com/CESNET/gridsite/wiki/Gridsite-release-page#GridSite_2251 GridSite v. 2.2.5 (issues #15)]
** one in caNL-c - '''UPDATE''' - fix provided with [https://github.com/CESNET/canl-c/wiki/caNl-c-Release-Page#caNlc_2151 caNL-c v. 2.1.5, issue #6]
* Related issue:
** https://ggus.eu/?mode=ticket_info&ticket_id=104009
** https://ggus.eu/?mode=ticket_info&ticket_id=105893
=== L&B & CREAM Update Issues - on EMI repositories ===
* '''UPDATE'''
** EMI 3 Update 18 solves some of the issues introduced with the previous update:
*** [https://github.com/CESNET/glite-lb/wiki/Glite-l%26b-release-page#LB_4121 L&B v. 4.1.2]


== 2.2 EMI-2 decommissioning  ==
== 2.2 EMI-2 decommissioning  ==


* UMD2 APEL clients - [bit.ly/apel_clients_umd2 - GGUS "Sites using UMD2 APEL clients"] - '''10 NGI not updated''':
* UMD2 APEL clients - [http://bit.ly/apel_clients_umd2 GGUS: Sites using UMD2 APEL clients] - '''9 NGI didn't finish the update''':
** NGI_UA, ROC_LA, ROC_Canada, NGI_CYGRID, NGI_HR, NGI_DE, ROC_Asia/Pacific, ROC_Russia, NGI_MARGI
** NGI_UA, ROC_LA, ROC_Canada, NGI_CYGRID, NGI_HR, NGI_DE, ROC_Asia/Pacific, ROC_Russia, NGI_MARGI


Line 136: Line 123:
** [https://wiki.egi.eu/wiki/MW_Nagios_tests#eu.egi.sec.dCache-2.2 SAM probe, eu.egi.sec.dCache-2.2, raising alarms]
** [https://wiki.egi.eu/wiki/MW_Nagios_tests#eu.egi.sec.dCache-2.2 SAM probe, eu.egi.sec.dCache-2.2, raising alarms]
** [http://bit.ly/1nWTYPW SAM results]:
** [http://bit.ly/1nWTYPW SAM results]:
*** NGI_DE - FZJ, DESY-ZN, GoeGrid, UNI-FREIBURG,  
*** '''NGI_DE''' - FZJ, DESY-ZN, GoeGrid, UNI-FREIBURG,  
*** NGI_NL - BEgrid-ULB-VUB
*** '''NGI_NL''' - BEgrid-ULB-VUB
*** ROC_Canada - CA-TRIUMF-T2K
*** '''ROC_Canada''' - CA-TRIUMF-T2K
*** '''ROC_Russia''' - JINR-T1


* Please provide timeline for update
* Please provide timeline for update


== 2.4 MySQL 5.0 EOL  ==
== 2.4 Configuration of the new VOMS server for OPS in the infrastructure AND SAM  ==
 
* [http://bit.ly/sites_new_VOMS_ops GGUS: Sites need to configure the new VOMS server for ops and LHC VOs]:
** '''NGI_AfricaArabia, NGI_IL, NGI_RO'''
* NGI's SAM Nagios have to be configured also - notification with instructions will be send
 
== 2.5 Resource Centres performance  ==
* [https://wiki.egi.eu/wiki/Performance RCs performance twiki]
* [https://documents.egi.eu/public/RetrieveFile?docid=2305&version=2&filename=EGI_Sep2014.pdf  Availability/Reliability Reports for Sept 2014]
** [https://wiki.egi.eu/wiki/Underperforming_sites_and_suspensions  Sept Underperforming_sites_and_suspensions] - to be updated with tickets opened
*** [http://bit.ly/sept14_ava_rel GGUS - 13 sites / 7 NGIs]: '''NDI_DE, NGI_IBERGRID, NGI_IT, NGI_NL, ROC_Asia/Pacific, ROC_LA, ROC_Russia'''
 
== 2.6 SAM Nagios probes re-factoring  ==
 
* changes in the enabled probes - will be done by today COB
** removed FTS2 from ROC and ROC_Operators profiles
*** ch.cern.FTS-ChannelList
*** ch.cern.FTS-InfoSites
** move GlueValidator to Operations profiles (ROC, ROC_Operators)
*** remove org.gstat.SanityCheck
* [https://wiki.egi.eu/wiki/SAM Documentation] under improvement
* SAM Update 23 in preparation - testing istance been set-up by SAM/ARGO team
 
== 2.7 MySQL 5.0 EOL  ==


* discussed during [https://wiki.egi.eu/wiki/URT:Agenda-29-09-2014#SL5_.26_MySQL_5.0_vs._MySQL_5.X.2C_x.3E.3D1 URT meeting, 29.09.2014]
* discussed during [https://wiki.egi.eu/wiki/URT:Agenda-29-09-2014#SL5_.26_MySQL_5.0_vs._MySQL_5.X.2C_x.3E.3D1 URT meeting, 29.09.2014]
Line 154: Line 165:


* Middleware dependancy on ''''''mysql-server'''''':
* Middleware dependancy on ''''''mysql-server'''''':
** VOMS - emi-voms-mysql
** VOMS - emi-voms-mysql - confirmed it should work with MySQL 5.1, as on SL6, but not tested
*** PT confirms no pb on SL5 using v.5.1, as on SL6
** CREAM - emi-cream-ce - [https://ggus.eu/?mode=ticket_info&ticket_id=106250 GGUS #106250]
** CREAM - emi-cream-ce
** DPM - [https://github.com/puppetlabs/puppetlabs-mysql/blob/master/manifests/params.pp puppetlabs/mysql dependancy on "mysql-server"],  
*** configuration only works for v. 5.0 ([https://ggus.eu/?mode=ticket_info&ticket_id=106250 GGUS #106250]), PT is planing an update
** STORM - storm-backend-server - confirmed it should work with MySQL 5.1, as on SL6, but not tested
** DPM - no clear dependancy on "mysql-server"
** L&B - glite-lb-server - under investigation
*** PT confirms no pb on SL5 using v.5.1, as on SL6
** WMS - emi-wms - under investigation
** STORM - storm-backend-server
*** PT confirms no pb on SL5 using v.5.1, as on SL6
** L&B - glite-lb-server, under investigation
** WMS - emi-wms, under investigation
 


* Recommendation - site-admins must be made aware to '''avoid using MySQL v. 5.0''', where possible.
* Recommendation - site-admins must be made aware to '''avoid using MySQL v. 5.0''', where possible.


== 2.5 classads "retired" from EPEL repos ==
== 2.8 SL/SLC/CentOS 5 Support Lifetime  ==
 
* during an EPEL "cleaning" process hte classads-* packages were retired from all EPEL & Fedora repositories
** problems installing new services like  CREAM, UI, WN, WMS, L&B
 
* Short term-solution
** classads-* packages were added the EMI-3 third-party repositories & soon to UMD3 repos
* Long-term solution
** the process to un-retire classads-* in EPEL was started
 
* It remains - '''XERCES-c''', emi-WN  - to be investigated and to apply the "short-term solution"
 
= 3. AOB  =
== 3.1 SL/SLC/CentOS 5 Support Lifetime  ==
* [https://www.scientificlinux.org/ Scientific Linux Homepage]
* [https://www.scientificlinux.org/ Scientific Linux Homepage]
* [http://linux.web.cern.ch/linux/scientific5/ SLC5]
* [http://linux.web.cern.ch/linux/scientific5/ SLC5]
Line 188: Line 181:
** January 31, 2014 - End of Production 2 phase
** January 31, 2014 - End of Production 2 phase
*** During the Production 3 Phase, Critical impact Security Advisories (RHSAs) and selected Urgent Priority Bug Fix Advisories (RHBAs) may be released as they become available.
*** During the Production 3 Phase, Critical impact Security Advisories (RHSAs) and selected Urgent Priority Bug Fix Advisories (RHBAs) may be released as they become available.
= 3. AOB  =


== 3.2 Next meetings  ==
== 3.2 Next meetings  ==


* '''Oct. 20, 2014'''
* '''Nov. 17, 2014''' (OMB on Oct. 30)
* Nov. 17, 2014 (OMB on Oct. 30)


= 4. Minutes  =
= 4. Minutes  =


* [https://indico.egi.eu/indico/materialDisplay.py?materialId=minutes&confId=2310 Minutes previous meeting - 08.09.2014]
* [https://indico.egi.eu/indico/materialDisplay.py?materialId=minutes&confId=2322 Minutes previous meeting - 06.10.2014]
* [https://indico.egi.eu/indico/conferenceDisplay.py?confId=2322 Minutes 06.10.2014]
* [https://indico.egi.eu/indico/materialDisplay.py?materialId=minutes&confId=2330 Minutes 20.10.2014]


[[Category:Operations]]
[[Category:Grid_Operations_Meetings]]

Latest revision as of 17:31, 23 October 2014

Audio conference link Conference system is Adobe Connect, no password required.
Audio conference details Indico page



1. Middleware releases and staged rollout

1.1 News from URT

Recent, or future planned, releases from the product teams:

1.2 UMD release

UMD 3.8.1 released on 09.10.2014 : http://repository.egi.eu/2014/10/09/release-umd-3-8-1/

1.3 Staged rollout updates

  • ARC v. 4.2.0 (13.11.2)
  • cream-slurm v. 1.0.2 (new)
  • emi-wn v. 3.1.0 (new)

In Verification

  • canl v. 2.2.3
  • cream-ge v. 2.2.0
  • cream-torque v. 2.1.4
  • dcache v. 2.6.35
  • dcache-srmclient v. 2.2.27
  • cgsi-gsoap v. 1.3.6
  • emi-ui v. 3.1.0
  • srm-ifce v. 1.20.1
  • wms v. 3.6.6
  • apel v. 1.2.2

New Products

  • cvmfs v. 2.1.19
  • squid v. 2.7.19
  • fts3 v. 3.2.27
  • gfal2 v. 2.6.8
  • gfal2-python v. 1.5.0
  • gfalfs v. 1.5.0
  • gfal2-utils v. 1.0.0
  • davix v. 0.3.6

Ready to be released:

  • qcg-computing-3.4.0
  • qcg-notifications-3.4.0
  • qcg-broker-3.4.0
  • qcg-broker-client-3.4.0
  • qcg-broker-egi-is-provider-3.4.0
  • qcg-egi-is-conf-3.4.0
  • cream-lsf v. 2.0.4
  • bdii-core 1.6.0 & Glue-validator 2.0.25
  • mpi v. 1.5.3

UMD 3 EA

  • Some sites have the contact points for the EA adopters outdated so please check in table if all contacts and products are still correct and send me email if you need to add / remove some contacts (SSO account mandatory): (full site list)

New Products

FTS3, SQUID and CVMFS will soon be include in UMD and it is important to have some early adopters for this components. So if you anyone interested please contact me or cristina to be included in the early adopter list.

1.4 Next releases

  • End of October
  • Mid of Dicember

2. Operational issues

2.1 Report from DMSU

ARGUS/WMS Certificate Chain Mixups

  • Affecting several sites, where WMS is unable to make SSL connection to ARGUS.
  • With all probability this is a combination of using curl from the SL6 distribution, which in built with NSS SSL rather than OpenSSL and, as such, does not really support proxy certificates, and a bug in Java, hopefully fixed since Java 7 Update 60.
  • Related issues:
  • This issue is already being investigated at 3rd level but PTs cannot decide who is responsible ant DMSU is overseeing.

2.2 EMI-2 decommissioning

  • UMD2 APEL clients - GGUS: Sites using UMD2 APEL clients - 9 NGI didn't finish the update:
    • NGI_UA, ROC_LA, ROC_Canada, NGI_CYGRID, NGI_HR, NGI_DE, ROC_Asia/Pacific, ROC_Russia, NGI_MARGI
  • Following up with COD - GGUS #106354
  • NGIs with UMD2/EMI2 services - AsiaPacific: - issue escaladed to EGI Operations
  • ALL SITES PROVIDING UMD 2/EMI 2 services MUST BE IN DOWNTIME or SUSPENDED

2.3 dCache 2.2.X decommissioning

  • Please provide timeline for update

2.4 Configuration of the new VOMS server for OPS in the infrastructure AND SAM

2.5 Resource Centres performance

2.6 SAM Nagios probes re-factoring

  • changes in the enabled probes - will be done by today COB
    • removed FTS2 from ROC and ROC_Operators profiles
      • ch.cern.FTS-ChannelList
      • ch.cern.FTS-InfoSites
    • move GlueValidator to Operations profiles (ROC, ROC_Operators)
      • remove org.gstat.SanityCheck
  • Documentation under improvement
  • SAM Update 23 in preparation - testing istance been set-up by SAM/ARGO team

2.7 MySQL 5.0 EOL

  • discussed during URT meeting, 29.09.2014
  • MySQL versions available:
  • SL5:
    • mysql-*5.0.95* -> MySQL 5.0
    • mysql51-*5.1.70* -> MySQl 5.1
    • mysql55-*5.5.32* -> MySQL 5.5
  • SL6:
    • mysql-*5.1.71 -> MySQL 5.1
  • Middleware dependancy on 'mysql-server':
    • VOMS - emi-voms-mysql - confirmed it should work with MySQL 5.1, as on SL6, but not tested
    • CREAM - emi-cream-ce - GGUS #106250
    • DPM - puppetlabs/mysql dependancy on "mysql-server",
    • STORM - storm-backend-server - confirmed it should work with MySQL 5.1, as on SL6, but not tested
    • L&B - glite-lb-server - under investigation
    • WMS - emi-wms - under investigation
  • Recommendation - site-admins must be made aware to avoid using MySQL v. 5.0, where possible.

2.8 SL/SLC/CentOS 5 Support Lifetime

3. AOB

3.2 Next meetings

  • Nov. 17, 2014 (OMB on Oct. 30)

4. Minutes