Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "Agenda-05-05-2014"

From EGIWiki
Jump to navigation Jump to search
(Created page with "{| |- | [http://connect.ct.infn.it/egi-inspire-sa1/ Audio conference link] | ''Conference system is Adobe Connect, no password required.'' |- | [https://indico.egi.eu/indico/mat...")
 
 
(14 intermediate revisions by 2 users not shown)
Line 41: Line 41:
== 1.2 UMD release  ==
== 1.2 UMD release  ==


* On 2nd of April it was released an updated of the EGI Trust Anchor release 1.56-1.
* No news
 
Released [https://wiki.egi.eu/wiki/UMD-3:UMD-3.6.0 UMD 3.6.0]:
 
* '''Updates''':
** wms v. 3.6.3
** cream_torque v. 2.1.3
** dpm-yaim v. 1.8.7
** gridsite v. 2.2.2
** glexec-wn v. 1.2.2
 
* '''New in UMD''':
** gfal2 v. 2.4.8
** slurm-wn v. 1.0.0
** cream-slurm v. 1.0.1
** gridsafe v. 1.3.1


== 1.3 Staged rollout updates  ==
== 1.3 Staged rollout updates  ==
Line 63: Line 48:


* wms v. 3.6.4
* wms v. 3.6.4
* gridway v. 5.14.2
* globus-info-provider-service v. 0.2.1


old stuff:
old stuff:
Line 75: Line 62:
* Contacting Sites ([https://www.egi.eu/earlyAdopters/table full site list]) in order to update the products list now for the UMD-3 products. '''Some sites have the contact points for the EA adopters outdated'''  so please check in table if all contacts are still correct and send me email if you need to add / remove some contacts (SSO account mandatory)
* Contacting Sites ([https://www.egi.eu/earlyAdopters/table full site list]) in order to update the products list now for the UMD-3 products. '''Some sites have the contact points for the EA adopters outdated'''  so please check in table if all contacts are still correct and send me email if you need to add / remove some contacts (SSO account mandatory)


* Sites who didn't replied and still tagged as EA for UMD1 / UMD2 products '''20 of the total 80 teams''' (updated Friday morning):
* Sites who didn't replied and still tagged as EA for UMD1 / UMD2 products (updated Monday morning):
** NGI_BG, BG01-IPP
** NGI_BG, BG01-IPP
** NGI_CZ, prague_cesnet_lcg2
** NGI_DE, FZK-LCG2
** NGI_DE, FZK-LCG2
** NGI-DK, UNICPH-NBI
** NGI-DK, UNICPH-NBI
Line 84: Line 70:
** NGI_HR, egee.srce.hr
** NGI_HR, egee.srce.hr
** NGI_IBERGRID, UPV-GRyCAP
** NGI_IBERGRID, UPV-GRyCAP
** NGI_MARGI, MK-01-UKIM_II
** NGI_NL, SARA
** NGI_NL, SARA
** NGI_SE, HPC2N  
** NGI_SE, HPC2N  
Line 90: Line 75:
** NGI_UK, UKI-LT2-RHUL
** NGI_UK, UKI-LT2-RHUL
** NGI_UK, UKI-NORTHGRID-LANCS-HEP  
** NGI_UK, UKI-NORTHGRID-LANCS-HEP  
** NGI_HR, egee.srce.hr
** NGI_UK, UKI-NORTHGRID-MAN-HEP
** NGI_UK, UKI-NORTHGRID-MAN-HEP
** NGI_UK, UKI-SCOTGRID-ECDF
** NGI_UK, UKI-SCOTGRID-ECDF
Line 96: Line 80:
** NGI_UK, UKI-SOUTHGRID-OX-HEP
** NGI_UK, UKI-SOUTHGRID-OX-HEP
** ROC_canada, CA-McGill-CLUMEQ-T2
** ROC_canada, CA-McGill-CLUMEQ-T2
** Mixed cases:
** NGI_UK, RAL-LCG2 (already participating in many UMD-3 products. Only some components need to be update: MyPROXY, APEL FTS)


== 1.4 Next releases  ==
== 1.4 Next releases  ==
Line 108: Line 89:


== 2.1 Report from DMSU  ==
== 2.1 Report from DMSU  ==
Nothing to report.
* from Alessandro Paolini:
** The nagios probe '''eu.egi.sec.DPM-GLUE2-EMI-1''' should be modified because it tries to detect some information that the new version of DPM doesn't publish any more
*** references: [https://ggus.eu/index.php?mode=ticket_info&ticket_id=104943 GGUS #104943],  [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105143 GGUS #105143]
 
== 2.2 Migration of Central SAM services & reconfiguration of NGIs SAM instances  ==
 
*Central SAM services are in the process of migrating from CERN to the new consortium (GRNET, CNRS and SRCE). In order to enable smooth transition we have agreed to start using new hostnames:
** '''mon.egi.eu for grid-monitoring.cern.ch'''
** '''opsmon.egi.eu for ops-monitor.cern.ch'''
*CERN services will be operational until May 1st. Afterwards aliases will point to new instances.
 
*If '''Regional & VO SAM instances''' are not re-configurred '''*it will stop working*''' after the switch off of the CERN instance.
 
*The following instances are not yet configured, and tkts have been opened to follow them up:
** cygrid-nagios.grid.ucy.ac.cy - [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105123 NGI_CYGRID #105123]
** grid-nagios.ii.edu.mk - [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105124 NGI_MARGI #105124]
** mon-ua.bitp.kiev.ua - [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105125 NGI_UA #105125]
** nagios.egee.cesnet.cz - [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105126 NGI_CZ #105126]
** nagios.ipp.acad.bg - [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105127 NGI_BG #105127]
** ngi-de-nagios.gridka.de - [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105128 NGI_DE #105128]
** node02-02.imi.renam.md - [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105129 NGI_MD #105129]
** rnag1.grid.kiae.ru - [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105130 ROC_Russia #105130]
** rocnagios.grid.sinica.edu.tw - [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105131 ROC_Asia/Pacific #105131]
** sam.grid.am - [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105132 NGI_ARMGRID #105132]
** wipp-srs.weizmann.ac.il - [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105133 NGI_IL #105133]
 


== 2.2 New Nagios Probes ==
<ul>
* ...
<li> Configuration advices:</li>
<ol><li> NGI and VO SAM Nagios instances</li>
* Create file /etc/voms2htpasswd-static.d/opsmon.conf with the following content:
/C=HR/O=edu/OU=srce/CN=opsmon.egi.eu
<li> NGI SAM Nagios instances</li>
* Set the following two variables in YAIM:
ATP_ROOT_URL="http://mon.egi.eu/atp"
  POEM_SYNC_URLS="http://mon.egi.eu/poem/api/0.1/json/"
<li> Rerun YAIM</li>
/opt/glite/yaim/bin/yaim -c -s site-info.def -n NAGIOS -n SAM_NAGIOS
</ol>
If you prefer '''not to run YAIM''' skip the steps 2 & 3 and perform the following:
<ol style="list-style-type:lower-latin"><li> NGI and VO SAM Nagios instances</li>
* Restart service voms-htpasswd:
service voms-htpasswd restart
<li> NGI SAM Nagios instances</li>
* Modify parameter POEM_SYNC_NS_URLS in file /etc/poem/poem_sync.ini:
POEM_SYNC_NS_URLS: http://mon.egi.eu/poem/api/0.1/json/
* Modify parameter ATP_ROOT_URL in file ncg/ncg.conf:
ATP_ROOT_URL=http://mon.egi.eu/atp
Parameter is repeated several times, you need to modify it on all places.
</ol>
</ul>


== 2.1 EMI-2 decommissioning  ==
== 2.1 EMI-2 decommissioning  ==
Line 131: Line 159:
==== 3.1 Next meeting  ====
==== 3.1 Next meeting  ====


'''... May 2014''' (we may skip one because of EGI CF)
'''June, 2nd 2014''' (we may skip one because of EGI CF)


= 4. Minutes  =
= 4. Minutes  =

Latest revision as of 14:04, 5 May 2014

Audio conference link Conference system is Adobe Connect, no password required.
Audio conference details Indico page



1. Middleware releases and staged rollout

1.1 News from URT

Recent, or future planned, releases from the product teams:

  • ARC - 13.11u1 version 4.1.0 planned to be released early April
    • Adding support for Rucio
  • dCache - UMD-3 dcache-server 2.6.23
    • Fixed WebDAV and HTTP support for libneon clients
  • BDII core - new glue-validator
  • DPM/LFC - v. 1.8.8
    • Several stability fixes
    • New DB connection pooling in dmlite-adapter and mysql
    • Workarounds if sysadmin accidentally called some mountpoints /dpm
    • DMLITE, DAV and LCGDM support explicit file placement request (pool/fs)
    • Improved the Glue reporting in dpm-listspaces
  • GFAL/lcg_utils - v. 2.5.5
    • Backported fix for segfault on the srm plugin
  • FTS3 - v. 3.1.74
    • Optimisations and bugfixes based on production testing in WLCG
  • GridSite - v. 2.2.3
    • fixing improperly initiated time constants in signature verification code
  • CANL - v. 2.1.4
    • fixes long-discussed certificate chain validation errors
  • WMS v. 3.6.4
    • fix for - job submission fails when job ID starts with a dash!

1.2 UMD release

  • No news

1.3 Staged rollout updates

New in SR:

  • wms v. 3.6.4
  • gridway v. 5.14.2
  • globus-info-provider-service v. 0.2.1

old stuff:

  • globus-default-security v. 5.2.4 ( 9 months)
  • security-integration v. 3.0.0 (7 months)
  • emi-cluster v. 2.0.1 ( 7 months)
  • globus-rls v. 5.2.5 (3 months)
  • mpi v. 1.5.3 ( 4 weeks )

UMD 3 Campaign

  • Contacting Sites (full site list) in order to update the products list now for the UMD-3 products. Some sites have the contact points for the EA adopters outdated so please check in table if all contacts are still correct and send me email if you need to add / remove some contacts (SSO account mandatory)
  • Sites who didn't replied and still tagged as EA for UMD1 / UMD2 products (updated Monday morning):
    • NGI_BG, BG01-IPP
    • NGI_DE, FZK-LCG2
    • NGI-DK, UNICPH-NBI
    • NGI_FRANCE, GRIF
    • NGI_FRANCE, IN2P3-CC
    • NGI_HR, egee.srce.hr
    • NGI_IBERGRID, UPV-GRyCAP
    • NGI_NL, SARA
    • NGI_SE, HPC2N
    • NGI_UA, UA-KNU
    • NGI_UK, UKI-LT2-RHUL
    • NGI_UK, UKI-NORTHGRID-LANCS-HEP
    • NGI_UK, UKI-NORTHGRID-MAN-HEP
    • NGI_UK, UKI-SCOTGRID-ECDF
    • NGI_UK, UKI-SOUTHGRID-CAM-HEP
    • NGI_UK, UKI-SOUTHGRID-OX-HEP
    • ROC_canada, CA-McGill-CLUMEQ-T2

1.4 Next releases

  • Middle May
  • End of June
  • October

2. Operational issues

2.1 Report from DMSU

  • from Alessandro Paolini:
    • The nagios probe eu.egi.sec.DPM-GLUE2-EMI-1 should be modified because it tries to detect some information that the new version of DPM doesn't publish any more

2.2 Migration of Central SAM services & reconfiguration of NGIs SAM instances

  • Central SAM services are in the process of migrating from CERN to the new consortium (GRNET, CNRS and SRCE). In order to enable smooth transition we have agreed to start using new hostnames:
    • mon.egi.eu for grid-monitoring.cern.ch
    • opsmon.egi.eu for ops-monitor.cern.ch
  • CERN services will be operational until May 1st. Afterwards aliases will point to new instances.
  • If Regional & VO SAM instances are not re-configurred *it will stop working* after the switch off of the CERN instance.


  • Configuration advices:
    1. NGI and VO SAM Nagios instances
      • Create file /etc/voms2htpasswd-static.d/opsmon.conf with the following content:
      /C=HR/O=edu/OU=srce/CN=opsmon.egi.eu
    2. NGI SAM Nagios instances
      • Set the following two variables in YAIM:
      ATP_ROOT_URL="http://mon.egi.eu/atp" POEM_SYNC_URLS="http://mon.egi.eu/poem/api/0.1/json/"
    3. Rerun YAIM
    4. /opt/glite/yaim/bin/yaim -c -s site-info.def -n NAGIOS -n SAM_NAGIOS

    If you prefer not to run YAIM skip the steps 2 & 3 and perform the following:

    1. NGI and VO SAM Nagios instances
      • Restart service voms-htpasswd:
      service voms-htpasswd restart
    2. NGI SAM Nagios instances
      • Modify parameter POEM_SYNC_NS_URLS in file /etc/poem/poem_sync.ini:
      POEM_SYNC_NS_URLS: http://mon.egi.eu/poem/api/0.1/json/
      • Modify parameter ATP_ROOT_URL in file ncg/ncg.conf:
      ATP_ROOT_URL=http://mon.egi.eu/atp Parameter is repeated several times, you need to modify it on all places.

2.1 EMI-2 decommissioning

2.3 Probes raising alarms since April

  • none

3. AOB

3.1 Next meeting

June, 2nd 2014 (we may skip one because of EGI CF)

4. Minutes

Minutes 07.04.2014