Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Agenda-04-07-2011

From EGIWiki
Revision as of 10:48, 4 July 2011 by Psolagna (talk | contribs)
Jump to navigation Jump to search
Main EGI.eu operations services Support Documentation Tools Activities Performance Technology Catch-all Services Resource Allocation Security



Detailed agenda: Grid Operations Meeting 04 July 2011 14h00 Amsterdam time

1. Middleware releases and staged rollout

1.1 EMI-1 release status (Cristina)

Slides from Cristina

  • EMI Update 2 23.06.2011
    • CREAM&CEMon v. 1.13.1
  • EMI Update 3: 07.07.2011
    • Storm SE (First release in EMI) v. 1.7.0
    • L&B v. 3.0.12
    • glite-proxyrenewal v. 1.3.21
    • glite-MPI v. 1.0.1
    • UNICORE UVOS v. 1.4.2

1.2. EMI/UMD current status

1.3. Staged Rollout (Mario)

1.3.1 gLite 3.1 series
  • WMS 3.2.17: installed and in production, waiting for the staged rollout report
1.3.2 gLite 3.2 series
1.3.3 EMI1 - UMD1

1.4 Interoperability (Michaela)

UNICORE
Globus
  • Last meeting was https://www.egi.eu/indico/conferenceDisplay.py?confId=496
  • Reminder to all NGIs to tell their sites to register all their Globus GT5 services in GOCDB, since this is a good time now with the upcoming SAM/Nagios release.
  • IGE will officially take over support for Nagios probes, details to be fixed.
  • LRZ will be an EA for Globus.
  • Looking for a future staged-rollout manager.
  • Globus/IGE people now also in EMI ComputeAccounting working group.
  • Next meeting second week of July.
  • Further information: Globus integration task force
ARC

Major problems in operations since this weekend due to waterfloding of NBI computerhall in Copenhagen infecting most NorduGrid infrastructure (GIIS, Mail, SVN, Download) except WWW. GIIS not working effects BDII services. Services went totally down from Saturday evening until Sunday afternoon. Emergency diesel power flooded as well. Some services still effected now: The one of the four global GIIS servers in Denmark and e.g NDGF-T1 mail server is also still down. Possible effect on all sites under http://www.nordugrid.org/monitor/ ARC-CEs in Copenhagen killed. d-Cache Pools in Denmark still kept alive. Most other ARC workernodes free and working fine, but no new jobs coming in. Weatherforcast for Denmark still bad after this worst Thunderstorm in history.

2. Operational Issues

2.1 Publishing site information in BDII

Most of the site in the EGI integrated infrastructure are correctly publishing SiteOtherInfo : GRID=EGI. There are still site that are publishing GRID=EGEE and the Resource infrastructure Provider name as EGEE_ROC instead of EGI_NGI:

GlueSiteOtherInfo: GRID=EGEE
GlueSiteOtherInfo: EGEE_SERVICE=prod
GlueSiteOtherInfo: EGEE_ROC=XXX

Should Be:

GlueSiteOtherInfo: EGEE_SERVICE=prod
GlueSiteOtherInfo: EGI_NGI=XXX
GlueSiteOtherInfo: GRID=EGI

The EGEE_ROC has to be always replaced by EGI_NGI. Sites that are publishing both GRID=EGEE andGRID=EGI should remove the first attribute.

3. AOB

3.1

Next Meeting:


<bars title="Site Visitors" ymin=0 ymax=8000 colors=999999,333333 stacked ylabel=4 xlabel legend>

  ,EU  ,US

Oct,4115,1230 Nov,2541, 911 Dec,5410,2433 </bars>