Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "Agenda-2021-07-12"

From EGIWiki
Jump to navigation Jump to search
Line 76: Line 76:
*** '''INFN-PISA''': HTCondorCE and SRM failures
*** '''INFN-PISA''': HTCondorCE and SRM failures
** NGI_UA: https://ggus.eu/index.php?mode=ticket_info&ticket_id=152258
** NGI_UA: https://ggus.eu/index.php?mode=ticket_info&ticket_id=152258
*** UA-BITP: authentication issues with one of the nagios servers, fixed; additionally, power supply issues at the resource center
*** '''UA-BITP''': authentication issues with one of the nagios servers, fixed; additionally, power supply issues at the resource center
*** UA-KNU: storage system degradation
*** '''UA-KNU''': storage system degradation
** ROC_LA: https://ggus.eu/index.php?mode=ticket_info&ticket_id=148956
** ROC_LA: https://ggus.eu/index.php?mode=ticket_info&ticket_id=148956
*** '''CBPF''': DPM updated; SRM failures due to information not properly published, fixed; other SRM failures due to available space
*** '''CBPF''': DPM updated; SRM failures due to information not properly published, fixed; other SRM failures due to available space
Line 84: Line 84:
*Under-performed sites after 3 consecutive months, under-performed NGIs, QoS violations: ('''June 2021'''):
*Under-performed sites after 3 consecutive months, under-performed NGIs, QoS violations: ('''June 2021'''):
** NGI_RO: https://ggus.eu/index.php?mode=ticket_info&ticket_id=152839
** NGI_RO: https://ggus.eu/index.php?mode=ticket_info&ticket_id=152839
*** CLOUDIFIN: problem with wsgi-keystone-oidc-voms and EGI Check-in ([https://ggus.eu/index.php?mode=ticket_info&ticket_id=151538 GGUS 151538]) between April and May; crashing of Nova service on some new compute nodes (new resources were added to the site) in the second half of June; statistics are now improving
*** '''CLOUDIFIN''': problem with wsgi-keystone-oidc-voms and EGI Check-in ([https://ggus.eu/index.php?mode=ticket_info&ticket_id=151538 GGUS 151538]) between April and May; crashing of Nova service on some new compute nodes (new resources were added to the site) in the second half of June; statistics are now improving; nn the entire period the users were not affected and all of the existing VMs were running without any problems.
On the entire period the users were not affected and all of the existing VMs were running without any problems.
** NGI_UA: https://ggus.eu/index.php?mode=ticket_info&ticket_id=152841
** NGI_UA: https://ggus.eu/index.php?mode=ticket_info&ticket_id=152841
*** UA-NSCMBR
*** '''UA-NSCMBR'''
** Russia: https://ggus.eu/index.php?mode=ticket_info&ticket_id=152840
** Russia: https://ggus.eu/index.php?mode=ticket_info&ticket_id=152840
*** RU-SARFTI
*** '''RU-SARFTI'''
*** RU-SPbSU
*** '''RU-SPbSU'''
*sites suspended:
*sites suspended:
**
**

Revision as of 09:45, 8 July 2021

Main EGI.eu operations services Support Documentation Tools Activities Performance Technology Catch-all Services Resource Allocation Security


Documentation menu: Home Manuals Procedures Training Other Contact For: VO managers Administrators


Back to https://wiki.egi.eu/wiki/Operations_Meeting

General information

Middleware

UMD

  • CentOS8 discussion still ongoing
  • UMD 4 June update
    • ARC 6.12.0 will be included in the upcoming release (end of June)
    • other products to be included: HTcondor, gfal2, lcmaps-plugins, xrootd 5.1.1, StoRM 1.11.21, DDNS probe
  • repository frontend web pages restored as static pages

Preview repository

  • released on 2021-05-20:
    • Preview 2.33.0 (CentOS 7): ARC 6.11.0, STORM 1.11.20 and 1.11.21, VOMS 04-21
  • released on 2021-06-10

Operations

ARGO/SAM

$ python -c 'import htcondor; ad = htcondor.Collector("collector2.opensciencegrid.org:9619").locate(htcondor.DaemonTypes.Schedd, "hosted-ce10.opensciencegrid.org"); print htcondor.SecMan().ping(ad, "READ")["ServerPublicCert"]' | openssl x509 -noout -subject -enddate
subject= /CN=hosted-ce10.opensciencegrid.org
notAfter=Apr 26 12:26:42 2021 GMT

FedCloud

Feedback from DMSU

New Known Error Database (KEDB)

The KEDB has been moved to Jira+Confluence: https://confluence.egi.eu/display/EGIKEDB/EGI+Federation+KEDB+Home

  • problems are tracked with Jira tickets to better follow-up their evoulution
  • problems can be registered by DMSU staff and EGI Operations team

Verify configuration records

On a yearly basis, the information registered into GOC-DB need to be verified. NGIs and RCs have been asked to check them. In particular:

  1. NGI managers should review the people registered and the roles assigned to them, and in particular check the following information:
    • E-Mail
    • ROD E-Mail
    • Security E-Mail
NGI Managers should also review the status of the "not certified" RCs, in according to the RC Status Workflow;
  1. RCs administrators should review the people registered and the roles assigned to them, and in particular check the following information:
    • E-Mail
    • telephone numbers
    • CSIRT E-Mail
RC administrators should also review the information related to the registered service endpoints.

The process should be completed by July 2nd.

List of tickets.

Monthly Availability/Reliability

IPv6 readiness plans

APEL migration from ActiveMQ to ARGO Message Service (AMS)

Prerequisites for using AMS

  • A valid host certificate from an IGTF Accredited CA.
  • A GOCDB 'Site' entry flagged as 'Production'.
  • A GOCDB 'Service' entry of the correct service type flagged as 'Production'. The following service types are used:
    • For Grid accounting use 'gLite-APEL'.
    • For Cloud accounting use 'eu.egi.cloud.accounting'.
    • For Storage accounting use 'eu.egi.storage.accounting'.
  • The 'Host DN' listed in the GOCDB 'Service' entry must exactly match the certificate DN of the host used for accounting. Make sure there are no leading or trailing spaces in the 'Host DN' field.

AOB

Next meeting

Aug