Difference between revisions of "Agenda-05-05-2014"
Jump to navigation
Jump to search
(13 intermediate revisions by 2 users not shown) | |||
Line 41: | Line 41: | ||
== 1.2 UMD release == | == 1.2 UMD release == | ||
* | * No news | ||
== 1.3 Staged rollout updates == | == 1.3 Staged rollout updates == | ||
Line 63: | Line 48: | ||
* wms v. 3.6.4 | * wms v. 3.6.4 | ||
* gridway v. 5.14.2 | |||
* globus-info-provider-service v. 0.2.1 | |||
old stuff: | old stuff: | ||
Line 75: | Line 62: | ||
* Contacting Sites ([https://www.egi.eu/earlyAdopters/table full site list]) in order to update the products list now for the UMD-3 products. '''Some sites have the contact points for the EA adopters outdated''' so please check in table if all contacts are still correct and send me email if you need to add / remove some contacts (SSO account mandatory) | * Contacting Sites ([https://www.egi.eu/earlyAdopters/table full site list]) in order to update the products list now for the UMD-3 products. '''Some sites have the contact points for the EA adopters outdated''' so please check in table if all contacts are still correct and send me email if you need to add / remove some contacts (SSO account mandatory) | ||
* Sites who didn't replied and still tagged as EA for UMD1 / UMD2 products | * Sites who didn't replied and still tagged as EA for UMD1 / UMD2 products (updated Monday morning): | ||
** NGI_BG, BG01-IPP | ** NGI_BG, BG01-IPP | ||
** NGI_DE, FZK-LCG2 | ** NGI_DE, FZK-LCG2 | ||
** NGI-DK, UNICPH-NBI | ** NGI-DK, UNICPH-NBI | ||
Line 84: | Line 70: | ||
** NGI_HR, egee.srce.hr | ** NGI_HR, egee.srce.hr | ||
** NGI_IBERGRID, UPV-GRyCAP | ** NGI_IBERGRID, UPV-GRyCAP | ||
** NGI_NL, SARA | ** NGI_NL, SARA | ||
** NGI_SE, HPC2N | ** NGI_SE, HPC2N | ||
Line 90: | Line 75: | ||
** NGI_UK, UKI-LT2-RHUL | ** NGI_UK, UKI-LT2-RHUL | ||
** NGI_UK, UKI-NORTHGRID-LANCS-HEP | ** NGI_UK, UKI-NORTHGRID-LANCS-HEP | ||
** NGI_UK, UKI-NORTHGRID-MAN-HEP | ** NGI_UK, UKI-NORTHGRID-MAN-HEP | ||
** NGI_UK, UKI-SCOTGRID-ECDF | ** NGI_UK, UKI-SCOTGRID-ECDF | ||
Line 96: | Line 80: | ||
** NGI_UK, UKI-SOUTHGRID-OX-HEP | ** NGI_UK, UKI-SOUTHGRID-OX-HEP | ||
** ROC_canada, CA-McGill-CLUMEQ-T2 | ** ROC_canada, CA-McGill-CLUMEQ-T2 | ||
== 1.4 Next releases == | == 1.4 Next releases == | ||
Line 108: | Line 89: | ||
== 2.1 Report from DMSU == | == 2.1 Report from DMSU == | ||
* from Alessandro Paolini: | |||
** The nagios probe '''eu.egi.sec.DPM-GLUE2-EMI-1''' should be modified because it tries to detect some information that the new version of DPM doesn't publish any more | |||
*** references: [https://ggus.eu/index.php?mode=ticket_info&ticket_id=104943 GGUS #104943], [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105143 GGUS #105143] | |||
== 2.2 Migration of Central SAM services & reconfiguration of NGIs SAM instances == | == 2.2 Migration of Central SAM services & reconfiguration of NGIs SAM instances == | ||
* ... | |||
*Central SAM services are in the process of migrating from CERN to the new consortium (GRNET, CNRS and SRCE). In order to enable smooth transition we have agreed to start using new hostnames: | |||
** '''mon.egi.eu for grid-monitoring.cern.ch''' | |||
** '''opsmon.egi.eu for ops-monitor.cern.ch''' | |||
*CERN services will be operational until May 1st. Afterwards aliases will point to new instances. | |||
*If '''Regional & VO SAM instances''' are not re-configurred '''*it will stop working*''' after the switch off of the CERN instance. | |||
*The following instances are not yet configured, and tkts have been opened to follow them up: | |||
** cygrid-nagios.grid.ucy.ac.cy - [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105123 NGI_CYGRID #105123] | |||
** grid-nagios.ii.edu.mk - [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105124 NGI_MARGI #105124] | |||
** mon-ua.bitp.kiev.ua - [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105125 NGI_UA #105125] | |||
** nagios.egee.cesnet.cz - [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105126 NGI_CZ #105126] | |||
** nagios.ipp.acad.bg - [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105127 NGI_BG #105127] | |||
** ngi-de-nagios.gridka.de - [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105128 NGI_DE #105128] | |||
** node02-02.imi.renam.md - [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105129 NGI_MD #105129] | |||
** rnag1.grid.kiae.ru - [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105130 ROC_Russia #105130] | |||
** rocnagios.grid.sinica.edu.tw - [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105131 ROC_Asia/Pacific #105131] | |||
** sam.grid.am - [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105132 NGI_ARMGRID #105132] | |||
** wipp-srs.weizmann.ac.il - [https://ggus.eu/index.php?mode=ticket_info&ticket_id=105133 NGI_IL #105133] | |||
<ul> | |||
<li> Configuration advices:</li> | |||
<ol><li> NGI and VO SAM Nagios instances</li> | |||
* Create file /etc/voms2htpasswd-static.d/opsmon.conf with the following content: | |||
/C=HR/O=edu/OU=srce/CN=opsmon.egi.eu | |||
<li> NGI SAM Nagios instances</li> | |||
* Set the following two variables in YAIM: | |||
ATP_ROOT_URL="http://mon.egi.eu/atp" | |||
POEM_SYNC_URLS="http://mon.egi.eu/poem/api/0.1/json/" | |||
<li> Rerun YAIM</li> | |||
/opt/glite/yaim/bin/yaim -c -s site-info.def -n NAGIOS -n SAM_NAGIOS | |||
</ol> | |||
If you prefer '''not to run YAIM''' skip the steps 2 & 3 and perform the following: | |||
<ol style="list-style-type:lower-latin"><li> NGI and VO SAM Nagios instances</li> | |||
* Restart service voms-htpasswd: | |||
service voms-htpasswd restart | |||
<li> NGI SAM Nagios instances</li> | |||
* Modify parameter POEM_SYNC_NS_URLS in file /etc/poem/poem_sync.ini: | |||
POEM_SYNC_NS_URLS: http://mon.egi.eu/poem/api/0.1/json/ | |||
* Modify parameter ATP_ROOT_URL in file ncg/ncg.conf: | |||
ATP_ROOT_URL=http://mon.egi.eu/atp | |||
Parameter is repeated several times, you need to modify it on all places. | |||
</ol> | |||
</ul> | |||
== 2.1 EMI-2 decommissioning == | == 2.1 EMI-2 decommissioning == | ||
Line 131: | Line 159: | ||
==== 3.1 Next meeting ==== | ==== 3.1 Next meeting ==== | ||
''' | '''June, 2nd 2014''' (we may skip one because of EGI CF) | ||
= 4. Minutes = | = 4. Minutes = |
Latest revision as of 14:04, 5 May 2014
Audio conference link | Conference system is Adobe Connect, no password required. |
Audio conference details | Indico page |
1. Middleware releases and staged rollout
1.1 News from URT
Recent, or future planned, releases from the product teams:
- ARC - 13.11u1 version 4.1.0 planned to be released early April
- Adding support for Rucio
- dCache - UMD-3 dcache-server 2.6.23
- Fixed WebDAV and HTTP support for libneon clients
- BDII core - new glue-validator
- DPM/LFC - v. 1.8.8
- Several stability fixes
- New DB connection pooling in dmlite-adapter and mysql
- Workarounds if sysadmin accidentally called some mountpoints /dpm
- DMLITE, DAV and LCGDM support explicit file placement request (pool/fs)
- Improved the Glue reporting in dpm-listspaces
- GFAL/lcg_utils - v. 2.5.5
- Backported fix for segfault on the srm plugin
- FTS3 - v. 3.1.74
- Optimisations and bugfixes based on production testing in WLCG
- GridSite - v. 2.2.3
- fixing improperly initiated time constants in signature verification code
- CANL - v. 2.1.4
- fixes long-discussed certificate chain validation errors
- WMS v. 3.6.4
- fix for - job submission fails when job ID starts with a dash!
1.2 UMD release
- No news
1.3 Staged rollout updates
New in SR:
- wms v. 3.6.4
- gridway v. 5.14.2
- globus-info-provider-service v. 0.2.1
old stuff:
- globus-default-security v. 5.2.4 ( 9 months)
- security-integration v. 3.0.0 (7 months)
- emi-cluster v. 2.0.1 ( 7 months)
- globus-rls v. 5.2.5 (3 months)
- mpi v. 1.5.3 ( 4 weeks )
UMD 3 Campaign
- Contacting Sites (full site list) in order to update the products list now for the UMD-3 products. Some sites have the contact points for the EA adopters outdated so please check in table if all contacts are still correct and send me email if you need to add / remove some contacts (SSO account mandatory)
- Sites who didn't replied and still tagged as EA for UMD1 / UMD2 products (updated Monday morning):
- NGI_BG, BG01-IPP
- NGI_DE, FZK-LCG2
- NGI-DK, UNICPH-NBI
- NGI_FRANCE, GRIF
- NGI_FRANCE, IN2P3-CC
- NGI_HR, egee.srce.hr
- NGI_IBERGRID, UPV-GRyCAP
- NGI_NL, SARA
- NGI_SE, HPC2N
- NGI_UA, UA-KNU
- NGI_UK, UKI-LT2-RHUL
- NGI_UK, UKI-NORTHGRID-LANCS-HEP
- NGI_UK, UKI-NORTHGRID-MAN-HEP
- NGI_UK, UKI-SCOTGRID-ECDF
- NGI_UK, UKI-SOUTHGRID-CAM-HEP
- NGI_UK, UKI-SOUTHGRID-OX-HEP
- ROC_canada, CA-McGill-CLUMEQ-T2
1.4 Next releases
- Middle May
- End of June
- October
2. Operational issues
2.1 Report from DMSU
- from Alessandro Paolini:
- The nagios probe eu.egi.sec.DPM-GLUE2-EMI-1 should be modified because it tries to detect some information that the new version of DPM doesn't publish any more
- references: GGUS #104943, GGUS #105143
- The nagios probe eu.egi.sec.DPM-GLUE2-EMI-1 should be modified because it tries to detect some information that the new version of DPM doesn't publish any more
2.2 Migration of Central SAM services & reconfiguration of NGIs SAM instances
- Central SAM services are in the process of migrating from CERN to the new consortium (GRNET, CNRS and SRCE). In order to enable smooth transition we have agreed to start using new hostnames:
- mon.egi.eu for grid-monitoring.cern.ch
- opsmon.egi.eu for ops-monitor.cern.ch
- CERN services will be operational until May 1st. Afterwards aliases will point to new instances.
- If Regional & VO SAM instances are not re-configurred *it will stop working* after the switch off of the CERN instance.
- The following instances are not yet configured, and tkts have been opened to follow them up:
- cygrid-nagios.grid.ucy.ac.cy - NGI_CYGRID #105123
- grid-nagios.ii.edu.mk - NGI_MARGI #105124
- mon-ua.bitp.kiev.ua - NGI_UA #105125
- nagios.egee.cesnet.cz - NGI_CZ #105126
- nagios.ipp.acad.bg - NGI_BG #105127
- ngi-de-nagios.gridka.de - NGI_DE #105128
- node02-02.imi.renam.md - NGI_MD #105129
- rnag1.grid.kiae.ru - ROC_Russia #105130
- rocnagios.grid.sinica.edu.tw - ROC_Asia/Pacific #105131
- sam.grid.am - NGI_ARMGRID #105132
- wipp-srs.weizmann.ac.il - NGI_IL #105133
- Configuration advices:
- NGI and VO SAM Nagios instances
- Create file /etc/voms2htpasswd-static.d/opsmon.conf with the following content:
- NGI SAM Nagios instances
- Set the following two variables in YAIM:
- Rerun YAIM /opt/glite/yaim/bin/yaim -c -s site-info.def -n NAGIOS -n SAM_NAGIOS
- NGI and VO SAM Nagios instances
- Restart service voms-htpasswd:
- NGI SAM Nagios instances
- Modify parameter POEM_SYNC_NS_URLS in file /etc/poem/poem_sync.ini:
- Modify parameter ATP_ROOT_URL in file ncg/ncg.conf:
If you prefer not to run YAIM skip the steps 2 & 3 and perform the following:
2.1 EMI-2 decommissioning
- Probes are running in midmon: Documentation.
- All products but dCache are being retired as previously announced. dCache extended the support for the 2.2.x versions until July 2014.
- List of services failing EMI-2 test:
- as of Mach 7th - Download XLS file
- as of March 3rd - EMI2_endpoints_NGI_07042014
- as of April 24 - EMI2_endpoints_NGI_24042014
- Status - presentation: PPTX, PPT, PDF
- Please report any errors ASAP
2.3 Probes raising alarms since April
- none
3. AOB
3.1 Next meeting
June, 2nd 2014 (we may skip one because of EGI CF)