Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "Agenda-04-07-2011"

From EGIWiki
Jump to navigation Jump to search
Line 36: Line 36:


===== 1.3.3 EMI1 - UMD1<br>  =====
===== 1.3.3 EMI1 - UMD1<br>  =====
*27 products are in the UMDStore area, which means that staged rollout has been performed, and they will be in the UMD1 release.
*The products missing (at the time of this meeting) and under staged rollout, are:&nbsp;arc-ce, arc-clients and cream (from EMI update 2)
*We are now in the process of preparing the release:&nbsp;collect release notes, issues found in verification and staged rollout, workarounds, etc..


==== 1.4 Interoperability (Michaela)  ====
==== 1.4 Interoperability (Michaela)  ====

Revision as of 10:53, 4 July 2011

Main EGI.eu operations services Support Documentation Tools Activities Performance Technology Catch-all Services Resource Allocation Security



Detailed agenda: Grid Operations Meeting 04 July 2011 14h00 Amsterdam time

1. Middleware releases and staged rollout

1.1 EMI-1 release status (Cristina)

Slides from Cristina

  • EMI Update 2 23.06.2011
    • CREAM&CEMon v. 1.13.1
  • EMI Update 3: 07.07.2011
    • Storm SE (First release in EMI) v. 1.7.0
    • L&B v. 3.0.12
    • glite-proxyrenewal v. 1.3.21
    • glite-MPI v. 1.0.1
    • UNICORE UVOS v. 1.4.2

1.2. EMI/UMD current status

1.3. Staged Rollout (Mario)

1.3.1 gLite 3.1 series
  • WMS 3.2.17: installed and in production, waiting for the staged rollout report
1.3.2 gLite 3.2 series
  • gLexec: No EA has answered the request yet for staged rollout. (This update is needed to bring in new versions of the LCMAPS-plugins-pep-c and PEP-API.)
1.3.3 EMI1 - UMD1
  • 27 products are in the UMDStore area, which means that staged rollout has been performed, and they will be in the UMD1 release.
  • The products missing (at the time of this meeting) and under staged rollout, are: arc-ce, arc-clients and cream (from EMI update 2)
  • We are now in the process of preparing the release: collect release notes, issues found in verification and staged rollout, workarounds, etc..

1.4 Interoperability (Michaela)

UNICORE
Globus
  • Last meeting was https://www.egi.eu/indico/conferenceDisplay.py?confId=496
  • Reminder to all NGIs to tell their sites to register all their Globus GT5 services in GOCDB, since this is a good time now with the upcoming SAM/Nagios release.
  • IGE will officially take over support for Nagios probes, details to be fixed.
  • LRZ will be an EA for Globus.
  • Looking for a future staged-rollout manager.
  • Globus/IGE people now also in EMI ComputeAccounting working group.
  • Next meeting second week of July.
  • Further information: Globus integration task force
ARC

Major problems in operations since this weekend due to waterfloding of NBI computerhall in Copenhagen infecting most NorduGrid infrastructure (GIIS, Mail, SVN, Download) except WWW. GIIS not working effects BDII services. Services went totally down from Saturday evening until Sunday afternoon. Emergency diesel power flooded as well. Some services still effected now: The one of the four global GIIS servers in Denmark and e.g NDGF-T1 mail server is also still down. Possible effect on all sites under http://www.nordugrid.org/monitor/ ARC-CEs in Copenhagen killed. d-Cache Pools in Denmark still kept alive. Most other ARC workernodes free and working fine, but no new jobs coming in. Weatherforcast for Denmark still bad after this worst Thunderstorm in history.

2. Operational Issues

2.1 Publishing site information in BDII

Most of the site in the EGI integrated infrastructure are correctly publishing SiteOtherInfo : GRID=EGI. There are still site that are publishing GRID=EGEE and the Resource infrastructure Provider name as EGEE_ROC instead of EGI_NGI:

GlueSiteOtherInfo: GRID=EGEE
GlueSiteOtherInfo: EGEE_SERVICE=prod
GlueSiteOtherInfo: EGEE_ROC=XXX

Should Be:

GlueSiteOtherInfo: EGEE_SERVICE=prod
GlueSiteOtherInfo: EGI_NGI=XXX
GlueSiteOtherInfo: GRID=EGI

The EGEE_ROC has to be always replaced by EGI_NGI. Sites that are publishing both GRID=EGEE andGRID=EGI should remove the first attribute.

3. AOB

3.1

Next Meeting: