Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "Agenda-17-11-2014"

From EGIWiki
Jump to navigation Jump to search
 
(14 intermediate revisions by the same user not shown)
Line 100: Line 100:
** [https://wiki.egi.eu/wiki/MW_Nagios_tests#eu.egi.sec.dCache-2.2 SAM probe, eu.egi.sec.dCache-2.2, raising alarms]
** [https://wiki.egi.eu/wiki/MW_Nagios_tests#eu.egi.sec.dCache-2.2 SAM probe, eu.egi.sec.dCache-2.2, raising alarms]
** [http://bit.ly/1nWTYPW SAM results]:
** [http://bit.ly/1nWTYPW SAM results]:
*** '''NGI_DE''' - FZJ, DESY-ZN, GoeGrid, UNI-FREIBURG,
*** '''NGI_DE''' - GoeGrid, UNI-FREIBURG:
*** '''NGI_NL''' - BEgrid-ULB-VUB
*** '''ROC_Canada''' - CA-TRIUMF-T2K
*** '''ROC_Russia''' - JINR-T1


* Please provide timeline for update
<code>
 
    1. NGI_DE/GoeGrid
    ggus: https://ggus.eu/index.php?mode=ticket_info&ticket_id=109904 - in progress (admins need more time - till the end of 2014)
    gocdb: https://goc.egi.eu/portal/index.php?Page_Type=Service&id=3072 - no service downtime
    nagios: https://midmon.egi.eu/nagios/cgi-bin/status.cgi?host=se-goegrid.gwdg.de - still critical
 
    2. NGI_DE/UNI-FREIBURG
    ggus: https://ggus.eu/index.php?mode=ticket_info&ticket_id=109905 - in progress (admins need more time - till December)
    gocdb: https://goc.egi.eu/portal/index.php?Page_Type=Service&id=3071 - no service downtime
    nagios: https://midmon.egi.eu/nagios/cgi-bin/status.cgi?host=se.bfg.uni-freiburg.de - still critical
 
</code>


== 2.4 Configuration of the new VOMS server for OPS in the infrastructure AND SAM  ==
== 2.4 Configuration of the new VOMS server for OPS in the infrastructure AND SAM  ==


* [http://bit.ly/sites_new_VOMS_ops GGUS: Sites need to configure the new VOMS server for ops and LHC VOs]:
* [http://bit.ly/sites_new_VOMS_ops GGUS: Sites need to configure the new VOMS server for ops and LHC VOs]:
** '''NGI_AfricaArabia, NGI_IL, NGI_RO'''
** '''NGI_IL, NGI_RO'''
* NGI's SAM Nagios have to be configured also - notification with instructions will be send


== 2.5 Resource Centres performance ==
== 2.5 SAM Nagios probes re-factoring ==
* [https://wiki.egi.eu/wiki/Performance RCs performance twiki]
* [https://documents.egi.eu/public/RetrieveFile?docid=2305&version=2&filename=EGI_Sep2014.pdf  Availability/Reliability Reports for Sept 2014]
** [https://wiki.egi.eu/wiki/Underperforming_sites_and_suspensions  Sept Underperforming_sites_and_suspensions] - to be updated with tickets opened
*** [http://bit.ly/sept14_ava_rel GGUS - 13 sites / 7 NGIs]: '''NDI_DE, NGI_IBERGRID, NGI_IT, NGI_NL, ROC_Asia/Pacific, ROC_LA, ROC_Russia'''


== 2.6 SAM Nagios probes re-factoring  ==
* SAM Update 23
** Staged-Rollout to be started this week (latest tomorrow)- staged-rollout repository in preparation
*** Staged-Rollout volunteers:
**** NGI_NDGF (Petter Urkedal)
**** NGI_FI (Ulf Tigerstedt)
**** NGI_UK (Kashif Mohammad)
**** NGI_IBERGRID (Esteban Freire)
** Documentation to be followed - [https://wiki.egi.eu/wiki/SAMUpdate23 - SAM Update 23 wiki]
** Major changes in SAM Update-23:
*** Probes are moved to the UMD-3 repository. This decision was approved by the OMB in order to enable probe developers to update probes more frequently and independently from SAM releases.
*** Removal of the SAM GridMon (sam-gridmon) and its dependencies. SAM Update-23 supports only SAM Nagios (sam-nagios). In the future version SAM GridMon will be replaced with the ARGO engine.
*** Detailed list of all tickets can be found here: [https://github.com/ARGOeu/sam-probes/issues?q=is%3Aissue+milestone%3AUpdate-23].


* changes in the enabled probes - will be done by today COB
== 2.6 MySQL 5.0 EOL  ==
** removed FTS2 from ROC and ROC_Operators profiles
*** ch.cern.FTS-ChannelList
*** ch.cern.FTS-InfoSites
** move GlueValidator to Operations profiles (ROC, ROC_Operators)
*** remove org.gstat.SanityCheck
* [https://wiki.egi.eu/wiki/SAM Documentation] under improvement
* SAM Update 23 in preparation - testing istance been set-up by SAM/ARGO team
 
== 2.7 MySQL 5.0 EOL  ==


* discussed during [https://wiki.egi.eu/wiki/URT:Agenda-29-09-2014#SL5_.26_MySQL_5.0_vs._MySQL_5.X.2C_x.3E.3D1 URT meeting, 29.09.2014]
* discussed during [https://wiki.egi.eu/wiki/URT:Agenda-29-09-2014#SL5_.26_MySQL_5.0_vs._MySQL_5.X.2C_x.3E.3D1 URT meeting, 29.09.2014]
Line 146: Line 152:
** DPM - [https://github.com/puppetlabs/puppetlabs-mysql/blob/master/manifests/params.pp puppetlabs/mysql dependancy on "mysql-server"],  
** DPM - [https://github.com/puppetlabs/puppetlabs-mysql/blob/master/manifests/params.pp puppetlabs/mysql dependancy on "mysql-server"],  
** STORM - storm-backend-server - confirmed it should work with MySQL 5.1, as on SL6, but not tested
** STORM - storm-backend-server - confirmed it should work with MySQL 5.1, as on SL6, but not tested
** L&B - glite-lb-server - under investigation
** '''L&B - glite-lb-server - under investigation - will be checked by the SoftwareProvisioning team on the EGI verification testbed'''
** WMS - emi-wms - under investigation
** '''WMS - emi-wms - under investigation - will be checked by the SoftwareProvisioning team on the EGI verification testbed'''


* Recommendation - site-admins must be made aware to '''avoid using MySQL v. 5.0''', where possible.
* Recommendation - site-admins must be made aware to '''avoid using MySQL v. 5.0''', where possible.


== 2.8 SL/SLC/CentOS 5 Support Lifetime  ==
== 2.7 SL/SLC/CentOS 5 Support Lifetime  ==
* [https://www.scientificlinux.org/ Scientific Linux Homepage]
* [https://www.scientificlinux.org/ Scientific Linux Homepage]
* [http://linux.web.cern.ch/linux/scientific5/ SLC5]
* [http://linux.web.cern.ch/linux/scientific5/ SLC5]
Line 161: Line 167:
= 3. AOB  =
= 3. AOB  =


== 3.2 Next meetings ==
== 3.1 Oct Availability/Reliability ==
 
* [https://ggus.eu/?mode=ticket_info&ticket_id=110090 GGUS #110090] - 34 ticket still opened
 
== 3.2 Work on a new Broadcast-usage procedure  ==
 
== 3.3 Next meetings ==


* '''Nov. 17, 2014''' (OMB on Oct. 30)
* '''Dec. 15, 2014''' (OMB on Nov. 27)


= 4. Minutes  =
= 4. Minutes  =


* [https://indico.egi.eu/indico/materialDisplay.py?materialId=minutes&confId=2330 Minutes previous meeting - 20.10.2014]
* [https://indico.egi.eu/indico/materialDisplay.py?materialId=minutes&confId=2330 Minutes previous meeting - 20.10.2014]
* [https://TOADD Minutes 17.11.2014]
* [https://indico.egi.eu/indico/materialDisplay.py?materialId=minutes&confId=2359 Minutes 17.11.2014]


[[Category:Grid_Operations_Meetings]]
[[Category:Grid_Operations_Meetings]]

Latest revision as of 16:03, 18 November 2014

Audio conference link Conference system is Adobe Connect, no password required.
Audio conference details Indico page



1. Middleware releases and staged rollout

1.1 News from URT

Recent, or future planned, releases from the product teams:

1.2 UMD release

UMD 3.9.0 released on 10.11.2014 : http://repository.egi.eu/2014/11/10/release-umd-3-9-0/

1.3 Staged rollout updates

  • cream-torque v. 2.1.4
  • cream v. 1.16.4
  • glexec-wn v. 1.3.0

In Verification

  • cream-ge v. 2.2.0
  • qcg-ntf v. 3.4.0 (SL6)
  • qcg-comp v. 3.4.0 (SL6)


New Products

  • squid v. 2.7.19
  • fts3 v. 3.2.27

Ready to be released:

....

UMD 3 EA

  • Some sites have the contact points for the EA adopters outdated so please check in table if all contacts and products are still correct and send me email if you need to add / remove some contacts (SSO account mandatory): (full site list)

New Products

FTS3, SQUID are to be include in UMD and it is important to have some early adopters for this components. So if you anyone interested please contact me or cristina to be included in the early adopter list.

1.4 Next releases

  • Mid of Dicember

2. Operational issues

2.1 Report from DMSU

ARGUS/WMS Certificate Chain Mixups

  • Affecting several sites, where WMS is unable to make SSL connection to ARGUS.
  • With all probability this is a combination of using curl from the SL6 distribution, which in built with NSS SSL rather than OpenSSL and, as such, does not really support proxy certificates, and a bug in Java, hopefully fixed since Java 7 Update 60.
  • Related issues:
  • This issue is already being investigated at 3rd level but PTs cannot decide who is responsible ant DMSU is overseeing.

2.2 EMI-2 decommissioning

2.3 dCache 2.2.X decommissioning

   1. NGI_DE/GoeGrid
   ggus: https://ggus.eu/index.php?mode=ticket_info&ticket_id=109904 - in progress (admins need more time - till the end of 2014)
   gocdb: https://goc.egi.eu/portal/index.php?Page_Type=Service&id=3072 - no service downtime
   nagios: https://midmon.egi.eu/nagios/cgi-bin/status.cgi?host=se-goegrid.gwdg.de - still critical 
   2. NGI_DE/UNI-FREIBURG 
   ggus: https://ggus.eu/index.php?mode=ticket_info&ticket_id=109905 - in progress (admins need more time - till December)
   gocdb: https://goc.egi.eu/portal/index.php?Page_Type=Service&id=3071 - no service downtime
   nagios: https://midmon.egi.eu/nagios/cgi-bin/status.cgi?host=se.bfg.uni-freiburg.de - still critical

2.4 Configuration of the new VOMS server for OPS in the infrastructure AND SAM

2.5 SAM Nagios probes re-factoring

  • SAM Update 23
    • Staged-Rollout to be started this week (latest tomorrow)- staged-rollout repository in preparation
      • Staged-Rollout volunteers:
        • NGI_NDGF (Petter Urkedal)
        • NGI_FI (Ulf Tigerstedt)
        • NGI_UK (Kashif Mohammad)
        • NGI_IBERGRID (Esteban Freire)
    • Documentation to be followed - - SAM Update 23 wiki
    • Major changes in SAM Update-23:
      • Probes are moved to the UMD-3 repository. This decision was approved by the OMB in order to enable probe developers to update probes more frequently and independently from SAM releases.
      • Removal of the SAM GridMon (sam-gridmon) and its dependencies. SAM Update-23 supports only SAM Nagios (sam-nagios). In the future version SAM GridMon will be replaced with the ARGO engine.
      • Detailed list of all tickets can be found here: [1].

2.6 MySQL 5.0 EOL

  • discussed during URT meeting, 29.09.2014
  • MySQL versions available:
  • SL5:
    • mysql-*5.0.95* -> MySQL 5.0
    • mysql51-*5.1.70* -> MySQl 5.1
    • mysql55-*5.5.32* -> MySQL 5.5
  • SL6:
    • mysql-*5.1.71 -> MySQL 5.1
  • Middleware dependancy on 'mysql-server':
    • VOMS - emi-voms-mysql - confirmed it should work with MySQL 5.1, as on SL6, but not tested
    • CREAM - emi-cream-ce - GGUS #106250
    • DPM - puppetlabs/mysql dependancy on "mysql-server",
    • STORM - storm-backend-server - confirmed it should work with MySQL 5.1, as on SL6, but not tested
    • L&B - glite-lb-server - under investigation - will be checked by the SoftwareProvisioning team on the EGI verification testbed
    • WMS - emi-wms - under investigation - will be checked by the SoftwareProvisioning team on the EGI verification testbed
  • Recommendation - site-admins must be made aware to avoid using MySQL v. 5.0, where possible.

2.7 SL/SLC/CentOS 5 Support Lifetime

3. AOB

3.1 Oct Availability/Reliability

3.2 Work on a new Broadcast-usage procedure

3.3 Next meetings

  • Dec. 15, 2014 (OMB on Nov. 27)

4. Minutes