Agenda-15-05-2017

From EGIWiki
Jump to: navigation, search
Main EGI.eu operations services Support Documentation Tools Activities Performance Technology Catch-all Services Resource Allocation Security


Documentation menu: Home Manuals Procedures Training Other Contact For: VO managers Administrators


Contents

General information

Middleware

CMD

UMD

Preview repository

released on:

Operations

Testing FedCloud sites

Resource Centre
STATUS
IFCA-LCG2 OK
IN2P3-IRES OK
100IT OK
RECAS-BARI OK
CESNET-MetaCloud OK
FZJ OK
BEgrid-BELNET OK
INFN-CATANIA-STACK OK
TR-FC1-ULAKBIM OK
INFN-PADOVA-STACK OK
IISAS-FedCloud OK
UPV-GRyCAP OK
IISAS-Nebula

OK (but not supporting fedcloud VO)

CLOUDIFIN <style type="text/css"><!--td {border: 1px solid #ccc;}br {mso-data-placement:same-cell;}--></style>https://ggus.eu/index.php?mode=ticket_info&ticket_id=128104
SCAI <style type="text/css"><!--td {border: 1px solid #ccc;}br {mso-data-placement:same-cell;}--></style>https://ggus.eu/?mode=ticket_info&ticket_id=127821
CYFRONET-CLOUD <style type="text/css"><!--td {border: 1px solid #ccc;}br {mso-data-placement:same-cell;}--></style>https://ggus.eu/index.php?mode=ticket_info&ticket_id=128100
CESGA <style type="text/css"><!--td {border: 1px solid #ccc;}br {mso-data-placement:same-cell;}--></style>https://ggus.eu/?mode=ticket_info&ticket_id=127815
BIFI <style type="text/css"><!--td {border: 1px solid #ccc;}br {mso-data-placement:same-cell;}--></style>https://ggus.eu/index.php?mode=ticket_info&ticket_id=128096
HG-09-Okeanos-Cloud replacing VMCaster with Cloudkeeper
CETA-GRID https://ggus.eu/?mode=ticket_info&ticket_id=124224
GoeGrid <style type="text/css"><!--td {border: 1px solid #ccc;}br {mso-data-placement:same-cell;}--></style>https://ggus.eu/?mode=ticket_info&ticket_id=128101
NCG-INGRID-PT keystone v3 with OpenID Connect (experimental)


Feedback from Helpdesk

yearly review of the information registered into GOC-DB

2017-04-07

On a yearly basis, the information registered into GOC-DB need to be verified. NGIs and RCs have been asked to check them. In particular:

  1. NGI managers should review the people registered and the roles assigned to them, and in particular check the following information:
    • E-Mail
    • ROD E-Mail
    • Security E-Mail
NGI Managers should also review the status of the "not certified" RCs, in according to the RC Status Workflow;
  1. RCs administrators should review the people registered and the roles assigned to them, and in particular check the following information:
    • E-Mail
    • telephone numbers
    • CSIRT E-Mail
RC administrators should also review the information related to the registered service endpoints.

The process should be completed by Apr 28th.

To track the process, a series of tickets have been opened.

2017-05-15 UPDATE:

Failures with the updated CREAM probes

After the release of the updated CREAM probes on May 4th, several sites are failing the JobCancel and/or JobPurge ones (GGUS 128151):

The main reason is that in those CEs there isn't a job slot reserved for the ops tests.

As explained in the CREAM probes wiki:

They both have a timeout of 15 minutes, so if the test job is not executed by that time, the probes return a failure. Please assign the ops jobs an higher priority and reserve them 1 job slot, they only require few seconds for being executed.

These failures didn't occur before May 4th because in the first version of the probes the returned status was "UNKNOWN" instead of the most proper one "CRITICAL".

List of failing CREAM-CEs from nagios (not all of them are affected by this problem):

Monthly Availability/Reliability

Proposal to modify the declaration of scheduled interventions

Currently (see MAN02 Service intervention management) scheduled interventions (of any duration) MUST be declared at least 24 hours in advance, specifying reason and duration; any intervention declared less than 24 hours in advance will be considered unscheduled.

WLCG proposed the following modification:

2017-05-15 UPDATE

At the last OMB, the WLCG proposal was rejected:

Then it was proposed to extend the advance notice time from 24 hours to 5 days, but neither in this case the NGIs was in favour of it.

New proposal: Would you be in favour of extending the advance notice time to 3 days for scheduled downtimes of any duration?

Decommissioning EMI WMS

As discussed at the February and April/May OMBs, we are making plans for decommissioning the WMS and moving to DIRAC.

NGIs provided WMS usage statistics, and in general the usage is relatively low, mainly for local testing

Moderate usage by few VOs:

EGI contacted these VOs to agree a smooth migration of their activities to DIRAC, only some of them replied till now:

We need the VO feedback for better defining technical details and timeline:

WMS servers can be decommissioned as soon as the supported VOs do not need them any more. The proposal is:

IPv6 readiness plans

Decommissioning of dCache 2.10 and 2.13 (to modify)



Testing the new webdav probes

Site Host GGUSID note
CYFRONET-LCG2 se01.grid.cyfronet.pl https://ggus.eu/index.php?mode=ticket_info&ticket_id=128325 SOLVED
GRIF node12.datagrid.cea.fr https://ggus.eu/index.php?mode=ticket_info&ticket_id=128329
IGI-BOLOGNA darkstorm.cnaf.infn.it https://ggus.eu/index.php?mode=ticket_info&ticket_id=127930 SOLVED
INFN-T1 storm-fe-lhcb.cr.cnaf.infn.it, storm-fe.cr.cnaf.infn.it, storm-fe-archive.cr.cnaf.infn.it https://ggus.eu/index.php?mode=ticket_info&ticket_id=128326
NCG-INGRID-PT gftp01.ncg.ingrid.pt https://ggus.eu/index.php?mode=ticket_info&ticket_id=128327 SOLVED
UKI-NORTHGRID-LIV-HEP hepgrid11.ph.liv.ac.uk https://ggus.eu/index.php?mode=ticket_info&ticket_id=128328 SOLVED
egee.irb.hr lorienmaster.irb.hr

Missing steps:

Testing of the storage accounting

As discussed during the January OMB, the APEL team would need one site per NGI for testing the storage accounting. The eligible sites are the ones providing either dCache or DPM storage elements.

More information can be found in the following wiki: https://wiki.egi.eu/wiki/APEL/Storage

List of sites available for test.

2017-05-15 UPDATE:

Currently the accounting service types are:
  1. glite-APEL: for authorizing the sending of the messages
  2. APEL: to monitor the accounting data publication


Monitoring of the UNCERTIFIED sites

Information about the proposal for using GOCDB as the only source of topology information for ARGO:


Nagios server for the uncertified sites: https://argo-mon-uncert.cro-ngi.hr/nagios/

PROC09 modified accordingly.

VAPOR

Next version will be focused on these computations to be able to provide better figures.

AOB

Next meeting

Personal tools
Namespaces
Variants
Actions
Navigation
Toolbox
Print/export