Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "Agenda-18-07-2016"

From EGIWiki
Jump to navigation Jump to search
 
(25 intermediate revisions by 2 users not shown)
Line 7: Line 7:


= UMD/CMD  =
= UMD/CMD  =
* '''UMD 3.14.3 to be released, fixing problem with gpgkeys, urgent'''
* UMD 3.14.2 released http://repository.egi.eu/2016/06/16/release-umd-3-14-2/
** StoRM 1.11.11, dCache 2.10.56, edg-mkgrid 4.0.3 (fix), Globus Network Manager on SL6
* UMD 4.1.1 released on 27-06-2016
** edg-mkgrid 4.0.3 (missing), Globus Network Manager on SL6
* '''UMD 4.2.0 to be released by July'''
** including on CentOS7: FTS3, QCG, dCache, ARGUS
** include on CentOS7 if possible: DPM, Globus/GFAL (fix issue with udt driver)
** add regular updates on SL6


== Preview repository ==
== Preview repository ==
Line 35: Line 47:


Results coming from NGI SAM instances are no longer consumed by the central ARGO or Operations Portal so NGIs can eventually decommission them following the standard decommissioning procedures (https://wiki.egi.eu/wiki/PROC12 ).
Results coming from NGI SAM instances are no longer consumed by the central ARGO or Operations Portal so NGIs can eventually decommission them following the standard decommissioning procedures (https://wiki.egi.eu/wiki/PROC12 ).
The FedCloud sites will be monitored by the new system starting from Aug 1st.


== New set of CREAM probes ==
== New set of CREAM probes ==
Line 143: Line 157:
| '''Note'''
| '''Note'''
|-
|-
| MK-03-FINKI
| <strike>MK-03-FINKI</strike>
| bdii.hpgcc.finki.ukim.mk, ce.hpgcc.finki.ukim.mk, se.hpgcc.finki.ukim.mk
| bdii.hpgcc.finki.ukim.mk, ce.hpgcc.finki.ukim.mk, se.hpgcc.finki.ukim.mk
| Top-BDII, Site-BDII/CREAM-CE, MyProxy/SRM
| Top-BDII, Site-BDII/CREAM-CE, MyProxy/SRM
| No
| No
| https://ggus.eu/?mode=ticket_info&ticket_id=122885
| https://ggus.eu/?mode=ticket_info&ticket_id=122885
|
| upgrade by Jul 22nd. SOLVED
|-
|-
| INDIACMS-TIFR
| <strike>INDIACMS-TIFR</strike>
| apel.indiacms.res.in, argus.indiacms.res.in
| apel.indiacms.res.in, argus.indiacms.res.in
| Site-BDII, ARGUS
| Site-BDII, ARGUS
Line 157: Line 171:
| downtime for upgrade for these services for Saturday and Sunday ( 16th and 17th); UPGRADED
| downtime for upgrade for these services for Saturday and Sunday ( 16th and 17th); UPGRADED
|-
|-
| UA-MHI
| <strike>UA-MHI</strike>
| arc.hpc-mhi.org
| arc.hpc-mhi.org
| Site-BDII/ARC-CE
| Site-BDII/ARC-CE
| No
| yes
| https://ggus.eu/?mode=ticket_info&ticket_id=122887
| https://ggus.eu/?mode=ticket_info&ticket_id=122887
|
| upgrade in a couple of weeks. Jul 28th: UPGRADED
|-
|-
| BMRZ-FRANKFURT
| <strike>BMRZ-FRANKFURT</strike>
| ce-enmr.chemie.uni-frankfurt.de, mb-enmr.chemie.uni-frankfurt.de
| ce-enmr.chemie.uni-frankfurt.de, mb-enmr.chemie.uni-frankfurt.de
| Site-BDII/CREAM-CE, WMS/LB
| Site-BDII/CREAM-CE, WMS/LB
| No
| No
| https://ggus.eu/?mode=ticket_info&ticket_id=122891
| https://ggus.eu/?mode=ticket_info&ticket_id=122891
|
| the site will be suspended and decommissioned. Jul 21st: SUSPENDED. SOLVED
|-
|-
| RTUEF
| <strike>RTUEF</strike>
| ce01.grid.etf.rtu.lv
| ce01.grid.etf.rtu.lv
| Site-BDII
| Site-BDII
| No
| No
| https://ggus.eu/?mode=ticket_info&ticket_id=122893
| https://ggus.eu/?mode=ticket_info&ticket_id=122893
|
| Site suspended by NGI
|-
|-
| SCAI
| SCAI
Line 185: Line 199:
| some services will be decommissioned, some service will be upgraded
| some services will be decommissioned, some service will be upgraded
|-
|-
| MY-UPM-BIRUNI-01
| <strike>MY-UPM-BIRUNI-01</strike>
| is.biruni.upm.my, haitham.biruni.upm.my and razi.biruni.upm.my, px.biruni.upm.my, lb.biruni.upm.my
| is.biruni.upm.my, haitham.biruni.upm.my and razi.biruni.upm.my, px.biruni.upm.my, lb.biruni.upm.my
| Site-BDII, CREAM-CE, MyProxy, LB
| Site-BDII, CREAM-CE, MyProxy, LB
| No
| No
| https://ggus.eu/?mode=ticket_info&ticket_id=122895
| https://ggus.eu/?mode=ticket_info&ticket_id=122895
|
| the upgrade will last a couple of weeks. Sep 22th: site suspended for running sl5 software.
|-
|-
| BG05-SUGrid
| <strike>BG05-SUGrid</strike>
| sbdii.grid.uni-sofia.bg
| sbdii.grid.uni-sofia.bg
| Site-BDII
| Site-BDII
Line 199: Line 213:
| upgrade scheduled; UPGRADED
| upgrade scheduled; UPGRADED
|-
|-
| UA_ICYB_ARC
| <strike>UA_ICYB_ARC</strike>
| uagrid.org.ua
| uagrid.org.ua
| Site-BDII/ARC-CE
| Site-BDII/ARC-CE
| yes
| yes
| https://ggus.eu/?mode=ticket_info&ticket_id=122897
| https://ggus.eu/?mode=ticket_info&ticket_id=122897
| upgrade scheduled for next week
| upgrade scheduled for next week. Jul 22nd: UPGRADED
|-
|-
| KR-UOS-SSCC
| <strike>KR-UOS-SSCC</strike>
| uosaf0006.sscc.uos.ac.kr
| uosaf0006.sscc.uos.ac.kr
| Site-BDII/CREAM
| Site-BDII/CREAM
Line 213: Line 227:
| will update the machine
| will update the machine
|-
|-
| UA_ICMP_ARC
| <strike>UA_ICMP_ARC</strike>
| west.icmp.lviv.ua  
| west.icmp.lviv.ua  
| Site-BDII/ARC-CE
| Site-BDII/ARC-CE
| yes
| yes
| https://ggus.eu/?mode=ticket_info&ticket_id=122899
| https://ggus.eu/?mode=ticket_info&ticket_id=122899
| planning to start upgrade next week and the corresponding downtime is scheduled starting from Sundays evening (Jul 17th 20:00) for 2 weeks
| planning to start upgrade next week and the corresponding downtime is scheduled starting from Sundays evening (Jul 17th 20:00) for 2 weeks. Aug 8th: DONE
|-
|-
| IR-IPM-HEP
| <strike>IR-IPM-HEP</strike>
| ce2.particles.ipm.ac.ir
| ce2.particles.ipm.ac.ir
| CREAM-CE
| CREAM-CE
| yes
| yes
| https://ggus.eu/?mode=ticket_info&ticket_id=122900
| https://ggus.eu/?mode=ticket_info&ticket_id=122900
| upgrading...
| upgrading... Jul 20th: DONE
|-
|-
| JP-KEK-CRC-02  
| <strike>JP-KEK-CRC-02</strike>
| kek2-ce01.cc.kek.jp, kek2-px.cc.kek.jp, kek2-wms.cc.kek.jp, kek2-lb.cc.kek.jp
| kek2-ce01.cc.kek.jp, kek2-px.cc.kek.jp, kek2-wms.cc.kek.jp, kek2-lb.cc.kek.jp
| CREAM-CE, MyProxy, WMS, LB
| CREAM-CE, MyProxy, WMS, LB
| yes
| yes
| https://ggus.eu/?mode=ticket_info&ticket_id=122901
| https://ggus.eu/?mode=ticket_info&ticket_id=122901
| will be out of service on Friday 5th Augst 2016 at 4:00 UTC
| will be out of service on Friday 5th Augst 2016 at 4:00 UTC. SOLVED
|-
|-
| CBPF
| <strike>CBPF</strike>
| myproxy.cat.cbpf.br
| myproxy.cat.cbpf.br
| MyProxy
| MyProxy
| yes
| yes
| https://ggus.eu/?mode=ticket_info&ticket_id=122902
| https://ggus.eu/?mode=ticket_info&ticket_id=122902
| will be decommissioned together nagios, downtime created
| will be decommissioned together nagios, downtime created https://goc.egi.eu/portal/index.php?Page_Type=Downtime&id=21306 . SOLVED
|-
|-
| WEIZMANN-LCG2
| <strike>WEIZMANN-LCG2</strike>
| wipp-rb.weizmann.ac.il
| wipp-rb.weizmann.ac.il
| WMS/LB/MyProxy
| WMS/LB/MyProxy
| No
| No
| https://ggus.eu/?mode=ticket_info&ticket_id=122905
| https://ggus.eu/?mode=ticket_info&ticket_id=122905
|
| site-admin on vacation; the WMS service is used only by their nagios server, they try to migrate to sl6 but found some issues in the interaction with ARGUS. SOLVED
|-
|-
| NIKHEF-ELPROD
| NIKHEF-ELPROD
Line 264: Line 278:
== Next meeting ==
== Next meeting ==


* '''08th Aug 2016''' https://indico.egi.eu/indico/event/3004/
* '''08th Aug 2016''' https://indico.egi.eu/indico/event/3004/ -> cancelled in favour of 12 September https://indico.egi.eu/indico/event/3005/
* new calendar available until end of 2016 https://indico.egi.eu/indico/category/32/
* new calendar available until end of 2016 https://indico.egi.eu/indico/category/32/

Latest revision as of 15:16, 26 September 2016


General information

UMD/CMD

  • UMD 4.1.1 released on 27-06-2016
    • edg-mkgrid 4.0.3 (missing), Globus Network Manager on SL6
  • UMD 4.2.0 to be released by July
    • including on CentOS7: FTS3, QCG, dCache, ARGUS
    • include on CentOS7 if possible: DPM, Globus/GFAL (fix issue with udt driver)
    • add regular updates on SL6

Preview repository

Generic information about Preview repository: https://wiki.egi.eu/wiki/Preview_Repository

Note: EGI provides the preview repository without any additional quality assurance process, but the products are released as they are provided by the product team. EGI recommends the use of the UMD repositories, which contain software verified through the quality assurance process of UMD.

Operations

EGI central monitoring instance (ARGO)

Since July 1st, the EGI infrastructure is being monitored by two monitoring instances that can be found on these addresses:

https://argo-mon.egi.eu/nagios https://argo-mon2.egi.eu/nagios

Both instances are running the same set of tests and results provided are equivalent.

Starting from the same date, the central ARGO Web UI (http://argo.egi.eu/lavoisier ) provides information from these two instances and the Operations Portal was reconfigured to raise alarms based on information from ARGO central instances.

Results coming from NGI SAM instances are no longer consumed by the central ARGO or Operations Portal so NGIs can eventually decommission them following the standard decommissioning procedures (https://wiki.egi.eu/wiki/PROC12 ).

The FedCloud sites will be monitored by the new system starting from Aug 1st.

New set of CREAM probes

A new set of probes is being used for monitoring the CREAM CEs and the A/R computation: https://wiki.italiangrid.it/twiki/bin/view/CREAM/DjsCreamProbeNew This set of probe doesn't make use of the BDII, WMS and the messaging infrastructure like instead did the old WN monitoring framework.

RFC proxy will be default

  • moving to RFC proxy instead of legacy proxy
  • in production since a while, everybody is using RFC
  • we will ask VOMS TP to make a little modification on VOMS client, changing the default

New configuration for DTEAM VO

The HellasGrid Certification Authority changed its DN from "/C=GR/O=HellasGrid/OU=Certification Authorities/CN=HellasGrid CA 2006" to "/C=GR/O=HellasGrid/OU=Certification Authorities/CN=HellasGrid CA 2016"

Since it is also changed the certificate of the 2 voms servers hosting dteam VO, the settings of this VO need to be updated accordingly on *ALL THE (grid and cloud) SERVICES*

- New yaim settings (for the ../vo.d/dteam file):

VOMS_SERVERS='vomss://voms.hellasgrid.gr:8443/voms/dteam?/dteam/'
VOMSES="'dteam voms.hellasgrid.gr 15004 /C=GR/O=HellasGrid/OU=hellasgrid.gr/CN=voms.hellasgrid.gr dteam 24' 'dteam voms2.hellasgrid.gr 15004 /C=GR/O=HellasGrid/OU=hellasgrid.gr/CN=voms2.hellasgrid.gr dteam 24'"
VOMS_CA_DN="'/C=GR/O=HellasGrid/OU=Certification Authorities/CN=HellasGrid CA 2016' '/C=GR/O=HellasGrid/OU=Certification Authorities/CN=HellasGrid CA 2016'"

- .lsc files:

# cat /etc/grid-security/vomsdir/dteam/voms.hellasgrid.gr.lsc
/C=GR/O=HellasGrid/OU=hellasgrid.gr/CN=voms.hellasgrid.gr
/C=GR/O=HellasGrid/OU=Certification Authorities/CN=HellasGrid CA 2016
# cat /etc/grid-security/vomsdir/dteam/voms2.hellasgrid.gr.lsc
/C=GR/O=HellasGrid/OU=hellasgrid.gr/CN=voms2.hellasgrid.gr
/C=GR/O=HellasGrid/OU=Certification Authorities/CN=HellasGrid CA 2016

- configuration information:

https://voms.hellasgrid.gr:8443/voms/dteam/configuration/configuration.action https://voms2.hellasgrid.gr:8443/voms/dteam/configuration/configuration.action


Monthly Availability/Reliability

A/R report on ARGO: http://argo.egi.eu/lavoisier/ngi_reports?accept=html

List of the underperforming RCs for (at least) 3 consecutive months:

'* NGI_MARGI https://ggus.eu/index.php?mode=ticket_info&ticket_id=118465 no monitoring data since January

Decommissioning SL5

Status and actions (Jul 14th)

  • 1 Top-BDII bdii.hpgcc.finki.ukim.mk
  • 11 Site-BDII apel.indiacms.res.in arc.hpc-mhi.org ce-enmr.chemie.uni-frankfurt.de ce.hpgcc.finki.ukim.mk ce01.grid.etf.rtu.lv glite-bdii.scai.fraunhofer.de is.biruni.upm.my sbdii.grid.uni-sofia.bg uagrid.org.ua uosaf0006.sscc.uos.ac.kr west.icmp.lviv.ua
  • 4 MyProxy kek2-px.cc.kek.jp myproxy.cat.cbpf.br wipp-rb.weizmann.ac.il
  • 7 WMS and 6 LB glite-wms.scai.fraunhofer.de graspol.nikhef.nl graszode.nikhef.nl graskant.nikhef.nl grasveld.nikhef.nl lb.biruni.upm.my kek2-wms.cc.kek.jp kek2-lb.cc.kek.jp mb-enmr.chemie.uni-frankfurt.de marwmsmr.in2p3.fr (downtime)
  • 1 VOMS glite-io.scai.fraunhofer.de
  • 1 ARGUS argus.indiacms.res.in
  • 7 CREAM-CE ce-enmr.chemie.uni-frankfurt.de ce.hpgcc.finki.ukim.mk ce2.particles.ipm.ac.ir glite-cream.scai.fraunhofer.de haitham.biruni.upm.my kek2-ce01.cc.kek.jp razi.biruni.upm.my
  • 0 QCG Computing
  • 1 STORM grid-se2.pr.infn.it (downtime)

RCs about to be suspended

Site Hostname Service Downtime Ticket Note
MK-03-FINKI bdii.hpgcc.finki.ukim.mk, ce.hpgcc.finki.ukim.mk, se.hpgcc.finki.ukim.mk Top-BDII, Site-BDII/CREAM-CE, MyProxy/SRM No https://ggus.eu/?mode=ticket_info&ticket_id=122885 upgrade by Jul 22nd. SOLVED
INDIACMS-TIFR apel.indiacms.res.in, argus.indiacms.res.in Site-BDII, ARGUS yes https://ggus.eu/?mode=ticket_info&ticket_id=122886 downtime for upgrade for these services for Saturday and Sunday ( 16th and 17th); UPGRADED
UA-MHI arc.hpc-mhi.org Site-BDII/ARC-CE yes https://ggus.eu/?mode=ticket_info&ticket_id=122887 upgrade in a couple of weeks. Jul 28th: UPGRADED
BMRZ-FRANKFURT ce-enmr.chemie.uni-frankfurt.de, mb-enmr.chemie.uni-frankfurt.de Site-BDII/CREAM-CE, WMS/LB No https://ggus.eu/?mode=ticket_info&ticket_id=122891 the site will be suspended and decommissioned. Jul 21st: SUSPENDED. SOLVED
RTUEF ce01.grid.etf.rtu.lv Site-BDII No https://ggus.eu/?mode=ticket_info&ticket_id=122893 Site suspended by NGI
SCAI glite-bdii.scai.fraunhofer.de, glite-io.scai.fraunhofer.de, glite-cream.scai.fraunhofer.de, glite-wms.scai.fraunhofer.de Site-BDII, VOMS, CREAM-CE, WMS/LB yes https://ggus.eu/?mode=ticket_info&ticket_id=122894 some services will be decommissioned, some service will be upgraded
MY-UPM-BIRUNI-01 is.biruni.upm.my, haitham.biruni.upm.my and razi.biruni.upm.my, px.biruni.upm.my, lb.biruni.upm.my Site-BDII, CREAM-CE, MyProxy, LB No https://ggus.eu/?mode=ticket_info&ticket_id=122895 the upgrade will last a couple of weeks. Sep 22th: site suspended for running sl5 software.
BG05-SUGrid sbdii.grid.uni-sofia.bg Site-BDII yes https://ggus.eu/?mode=ticket_info&ticket_id=122896 upgrade scheduled; UPGRADED
UA_ICYB_ARC uagrid.org.ua Site-BDII/ARC-CE yes https://ggus.eu/?mode=ticket_info&ticket_id=122897 upgrade scheduled for next week. Jul 22nd: UPGRADED
KR-UOS-SSCC uosaf0006.sscc.uos.ac.kr Site-BDII/CREAM No https://ggus.eu/?mode=ticket_info&ticket_id=122898 will update the machine
UA_ICMP_ARC west.icmp.lviv.ua Site-BDII/ARC-CE yes https://ggus.eu/?mode=ticket_info&ticket_id=122899 planning to start upgrade next week and the corresponding downtime is scheduled starting from Sundays evening (Jul 17th 20:00) for 2 weeks. Aug 8th: DONE
IR-IPM-HEP ce2.particles.ipm.ac.ir CREAM-CE yes https://ggus.eu/?mode=ticket_info&ticket_id=122900 upgrading... Jul 20th: DONE
JP-KEK-CRC-02 kek2-ce01.cc.kek.jp, kek2-px.cc.kek.jp, kek2-wms.cc.kek.jp, kek2-lb.cc.kek.jp CREAM-CE, MyProxy, WMS, LB yes https://ggus.eu/?mode=ticket_info&ticket_id=122901 will be out of service on Friday 5th Augst 2016 at 4:00 UTC. SOLVED
CBPF myproxy.cat.cbpf.br MyProxy yes https://ggus.eu/?mode=ticket_info&ticket_id=122902 will be decommissioned together nagios, downtime created https://goc.egi.eu/portal/index.php?Page_Type=Downtime&id=21306 . SOLVED
WEIZMANN-LCG2 wipp-rb.weizmann.ac.il WMS/LB/MyProxy No https://ggus.eu/?mode=ticket_info&ticket_id=122905 site-admin on vacation; the WMS service is used only by their nagios server, they try to migrate to sl6 but found some issues in the interaction with ARGUS. SOLVED
NIKHEF-ELPROD graspol.nikhef.nl graszode.nikhef.nl, graskant.nikhef.nl grasveld.nikhef.nl WMS, LB No https://ggus.eu/?mode=ticket_info&ticket_id=122903 they want to keep the services, ticket UNSOLVED

FedCloud status

AOB

Next meeting