Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "Agenda-2021-10-11"

From EGIWiki
Jump to navigation Jump to search
 
(6 intermediate revisions by 2 users not shown)
Line 8: Line 8:
== UMD ==
== UMD ==


* no news related to CentOS8
* including EOS in UMD


== Preview repository  ==
== Preview repository  ==
Line 15: Line 15:
* released on 2021-08-11
* released on 2021-08-11
** '''[https://appdb.egi.eu/store/software/preview.repository/releases/2.0/2.35.0/ Preview 2.35.0]''' (CentOS 7): APEL SSM 3.2.1, DPM/DMLite 1.15.0 and 1.15.1, frontier-squid 4.15.2, xrootd 5.3.0
** '''[https://appdb.egi.eu/store/software/preview.repository/releases/2.0/2.35.0/ Preview 2.35.0]''' (CentOS 7): APEL SSM 3.2.1, DPM/DMLite 1.15.0 and 1.15.1, frontier-squid 4.15.2, xrootd 5.3.0
* We plan to stop the release of Preview since it doesn't seem to be used very much, and it is also easier to catch the last version of the products from EPEL or the product teams repos, prior the release in UMD.


= Operations  =
= Operations  =
Line 45: Line 46:
*** '''INFN-PISA''': HTCondorCE failures fixed; SRM failures not yet
*** '''INFN-PISA''': HTCondorCE failures fixed; SRM failures not yet
** NGI_PL: https://ggus.eu/index.php?mode=ticket_info&ticket_id=153659
** NGI_PL: https://ggus.eu/index.php?mode=ticket_info&ticket_id=153659
*** '''TASK''': oin the process of replacing QCG with ARC-CE
*** '''TASK''': in the process of replacing QCG with ARC-CE
** NGI_UA: https://ggus.eu/index.php?mode=ticket_info&ticket_id=152841
** NGI_UA: https://ggus.eu/index.php?mode=ticket_info&ticket_id=152841
*** '''UA-NSCMBR''': problem during the DPM update: conflict between xrootd 5 and dmlite 1.13. Unscheduled downtime due to power failure in the computing centre. NFS configuration issue affected ARC-CE. Accounting data republished using the ARC accountng functionalities.
*** '''UA-NSCMBR''': problem during the DPM update: conflict between xrootd 5 and dmlite 1.13. Unscheduled downtime due to power failure in the computing centre. NFS configuration issue affected ARC-CE. Accounting data republished using the ARC accountng functionalities.
Line 62: Line 63:
*** '''SE-SNIC-T2'''
*** '''SE-SNIC-T2'''
** NGI_UA: https://ggus.eu/index.php?mode=ticket_info&ticket_id=154298
** NGI_UA: https://ggus.eu/index.php?mode=ticket_info&ticket_id=154298
*** '''UA-IFBG'''
*** '''UA-IFBG''': ARC-CE failures, now fixed.
** ROC_LA: https://ggus.eu/index.php?mode=ticket_info&ticket_id=154292
** ROC_LA: https://ggus.eu/index.php?mode=ticket_info&ticket_id=154292
*** '''AstrogridPUC'''
*** '''AstrogridPUC''': failures on ARC-CE and SRM/webdav; storage element to be retired.




Line 115: Line 116:
** 11 tickets (out of 112) not solved yet
** 11 tickets (out of 112) not solved yet
*** '''Australia-ATLAS'''[https://ggus.eu/index.php?mode=ticket_info&ticket_id=152428 152428] and '''Australia-T2''' [https://ggus.eu/index.php?mode=ticket_info&ticket_id=152429 152429]: they stilll have ARC-CE 5.4; moving to a Cloudscheduler based compute system and will be removing the ARC-CE's in the near future
*** '''Australia-ATLAS'''[https://ggus.eu/index.php?mode=ticket_info&ticket_id=152428 152428] and '''Australia-T2''' [https://ggus.eu/index.php?mode=ticket_info&ticket_id=152429 152429]: they stilll have ARC-CE 5.4; moving to a Cloudscheduler based compute system and will be removing the ARC-CE's in the near future
*** '''CA-SFU-T2''' [https://ggus.eu/index.php?mode=ticket_info&ticket_id=152433 152433]: CEs updated, check the accounting publication in the coming days... some errors with the benchmark which seem harmful. Duplicated records for the previous months, it was suggested to set `apel_messages = summaries` in the arc conf file.
*** '''CA-SFU-T2''' [https://ggus.eu/index.php?mode=ticket_info&ticket_id=152433 152433]: CEs updated; some errors with the benchmark which seem harmful. Duplicated records for the previous months were cleaned, it was suggested to set `apel_messages = summaries` in the arc conf file; investigating on some inconsistencies.
*** '''IN2P3-IPNL''' [https://ggus.eu/index.php?mode=ticket_info&ticket_id=152460 152460]: CE not yet in production
*** '''IN2P3-IPNL''' [https://ggus.eu/index.php?mode=ticket_info&ticket_id=152460 152460]: CE not yet in production
*** '''JP-KEK-CRC-02''' [https://ggus.eu/index.php?mode=ticket_info&ticket_id=152471 152471]: installed new CEs, some authz failures with sending the records...
*** '''JP-KEK-CRC-02''' [https://ggus.eu/index.php?mode=ticket_info&ticket_id=152471 152471]: installed new CEs, some authz failures with sending the records...
Line 140: Line 141:


= AOB  =
= AOB  =
 
* EGI Conference 18 - 21 Oct 2021: https://indico.egi.eu/event/5464/overview


== Next meeting  ==
== Next meeting  ==
Nov
Nov

Latest revision as of 12:51, 11 October 2021

Main EGI.eu operations services Support Documentation Tools Activities Performance Technology Catch-all Services Resource Allocation Security


Documentation menu: Home Manuals Procedures Training Other Contact For: VO managers Administrators


Back to https://wiki.egi.eu/wiki/Operations_Meeting

General information

Middleware

UMD

  • including EOS in UMD

Preview repository

  • released on 2021-06-10
  • released on 2021-08-11
    • Preview 2.35.0 (CentOS 7): APEL SSM 3.2.1, DPM/DMLite 1.15.0 and 1.15.1, frontier-squid 4.15.2, xrootd 5.3.0
  • We plan to stop the release of Preview since it doesn't seem to be used very much, and it is also easier to catch the last version of the products from EPEL or the product teams repos, prior the release in UMD.

Operations

ARGO/SAM

FedCloud

Feedback from DMSU

New Known Error Database (KEDB)

The KEDB has been moved to Jira+Confluence: https://confluence.egi.eu/display/EGIKEDB/EGI+Federation+KEDB+Home

  • problems are tracked with Jira tickets to better follow-up their evoulution
  • problems can be registered by DMSU staff and EGI Operations team

Monthly Availability/Reliability


  • sites suspended:

Documentation

IPv6 readiness plans

APEL migration from ActiveMQ to ARGO Message Service (AMS)

Prerequisites for using AMS

  • A valid host certificate from an IGTF Accredited CA.
  • A GOCDB 'Site' entry flagged as 'Production'.
  • A GOCDB 'Service' entry of the correct service type flagged as 'Production'. The following service types are used:
    • For Grid accounting use 'gLite-APEL'.
    • For Cloud accounting use 'eu.egi.cloud.accounting'.
    • For Storage accounting use 'eu.egi.storage.accounting'.
  • The 'Host DN' listed in the GOCDB 'Service' entry must exactly match the certificate DN of the host used for accounting. Make sure there are no leading or trailing spaces in the 'Host DN' field.

Monitoring of the accounting data

To ensure the monitoring of the publication of the accounting data, one CE per site needs to be registered as "APEL" service endpoint.

AOB

Next meeting

Nov