Difference between revisions of "EGI-InSPIRE:Sa1 2012-10-24"

From EGIWiki
Jump to: navigation, search
 
(15 intermediate revisions by 5 users not shown)
Line 1: Line 1:
{{Template:Op menubar}} {{TOC_right}} SA1 weekly report
+
{{Template:EGI-Inspire menubar}}
 +
 
 +
{{Template:Inspire_reports_menubar}}
 +
{{TOC_right}}  
  
 
=Progress of SA1 issues= <!-- T. Ferrari -->  
 
=Progress of SA1 issues= <!-- T. Ferrari -->  
Line 7: Line 10:
 
=Milestones/Deliverables= <!-- T. Ferrari -->  
 
=Milestones/Deliverables= <!-- T. Ferrari -->  
  
*D4.6 Operations Architecture:
+
*D4.6 Operations Architecture: external review completed
 +
*D4.7 Operations sustainability: no progress last week
  
 
=SA1.1 Activity Management= <!-- S. Burke, T. Ferrari,  M. Krakowian, P. Solagna -->  
 
=SA1.1 Activity Management= <!-- S. Burke, T. Ferrari,  M. Krakowian, P. Solagna -->  
 +
* meetings:
 +
** GDB for discussion of suspension policy related to the deployment of unsupported glite 3.1/3.2 services
 +
** ISGC PC phone call
 +
** fedSM weekly call
 +
** COD meeting
 +
** PRACE/EUDAT/EGI workshop organization meeting
 +
** EGI Operations meeting, chaired and agenda preparation
 +
** Fedclouds task force
 +
** COD/CSIRT meeting to evaluate the status of unsupported middleware monitoring process
 +
** COD meeting to evaluate status of current activities, and promote new ones: revision of current and future probes, support to emerging NGIs and new site administrators, technical revision of automated certification procedure (in progress)
  
*meetings:
+
Main activity of the last 7 days was the coordination of activities around the decommissioning of unsupported software, followup of missing/incomplete probes, procedures for ticketing of sites and related escalation
**GDB for discussion of suspension policy related to the deployment of unsupported glite 3.1/3.2 services
+
* Follow-up of suspension cases for low availability
**ISGC PC phone call
+
* Revision of operations service portfolio following gSLM guidelines (preparatory work for D4.7 deliverable on service sustainability)
**fedSM weekly call
+
* ops VO membership management
**COD meeting
+
* DTEAM VO membership management
**PRACE/EUDAT/EGI workshop organization meeting
+
* operations ticket handling, handling of various A/R esclations
*follow-up of suspension cases for low availability  
+
* Produced assessment on the VO usage of deployed gLite services
*revision of operations service portfolio following gSLM guidelines (preparatory work for D4.7 deliverable on service sustainability)
+
* Follow up of sites/NGIs with missing security contacts
 +
* Preparation of OMB agenda
  
 
=SA1.2 Security= <!-- D. Kelsey -->  
 
=SA1.2 Security= <!-- D. Kelsey -->  
Line 40: Line 55:
  
 
*collection of feedback from GLOBUS TF about site certification procedure
 
*collection of feedback from GLOBUS TF about site certification procedure
 +
*planning of workshop on use cases for data management across peer infrastructures
  
 
=SA1.4 Central tools= <!--E. Imamagic -->  
 
=SA1.4 Central tools= <!--E. Imamagic -->  
  
*By the end of Tuesday 26 NGIs have deployed SAM Update-17  
+
*By the end of Tuesday 26 NGIs have deployed SAM Update-17, assessment of status of deployment in order to allow CUSTOM services in GOCDB
 
*Help with MW monitoring instance setup  
 
*Help with MW monitoring instance setup  
 
*Meeting between EMI Messaging PT and broker administrators scheduled for Wednesday  
 
*Meeting between EMI Messaging PT and broker administrators scheduled for Wednesday  
 
*Solved: problem of nagios binary and SL6 WNs (https://tomtools.cern.ch/jira/browse/SAM-2999)  
 
*Solved: problem of nagios binary and SL6 WNs (https://tomtools.cern.ch/jira/browse/SAM-2999)  
 
*Solved: problem with false alarms raised in Dashboard by SAM-Nagios instance in Beijing (https://ggus.eu/ws/ticket_info.php?ticket=87276)  
 
*Solved: problem with false alarms raised in Dashboard by SAM-Nagios instance in Beijing (https://ggus.eu/ws/ticket_info.php?ticket=87276)  
*Messaging: A new version the message broker service is ready to be deployed in the PROD
+
*Messaging: A new version the message broker service is ready to be deployed in the PROD network. This will be an update from: ActiveMQ 5.5.1-fuse-05-01 To: ActiveMQ 5.5.1-fuse-08-15. A complete changelog can be found at: http://fusesource.com/wiki/display/ProdInfo/FUSE+Message+Broker+v5.5.1-fuse+Release+Notes
  
network. This will be an update from: ActiveMQ 5.5.1-fuse-05-01 To: ActiveMQ 5.5.1-fuse-08-15
+
=SA1.5 Accounting= <!--J. Gordon-->
  
A complete changelog can be found at: http://fusesource.com/wiki/display/ProdInfo/FUSE+Message+Broker+v5.5.1-fuse+Release+Notes
+
Worked with a number of sites to publish UserDNs.  
  
=SA1.5 Accounting= <!--J. Gordon-->
+
NDGF now publishing from SGAS&nbsp;via SSM&nbsp;in production.  
  
<br> =SA1.6 Helpdesk= <!-- T. Antoni -->  
+
Final testing of DGAS&nbsp;with SSM.
 +
 
 +
=SA1.6 Helpdesk= <!-- T. Antoni -->
  
 
*revision of ticket escalation procedures in case of tickets without activity  
 
*revision of ticket escalation procedures in case of tickets without activity  
Line 62: Line 80:
 
*preparing the first meeting of the GGUS advisory board
 
*preparing the first meeting of the GGUS advisory board
  
=SA1.7 Support= <!--  trompert -->  
+
=SA1.7 Support=  
 +
<!--  trompert -->  
 +
*published newsletter
 +
*usual tickethandling and dashboard work
 +
*cod meeting
 +
*prepare cod f2f
  
<br> == Software Support == <!-- A Krenek --> Working smoothly, no specific issues to report.  
+
== Software Support ==  
 +
<!-- A Krenek --> Working smoothly, no specific issues to report.  
  
 
{| class="wikitable"
 
{| class="wikitable"
Line 126: Line 150:
  
 
*PRACE/EGI/community meeting on data management (October/beginning of November)
 
*PRACE/EGI/community meeting on data management (October/beginning of November)
 
[[Category:SA1_weekly_report]]
 

Latest revision as of 17:47, 6 January 2015

EGI Inspire Main page


Inspire reports menu: Home SA1 weekly Reports SA1 Task QR Reports NGI QR Reports NGI QR User support Reports



Progress of SA1 issues

(SA1) integration of Albania: the site administrators participated to TF12 training and promised progress in the set up of a grid site (SA1) PQ8 Grid software maintenance and support. Waiting for TCB discussions

Milestones/Deliverables

  • D4.6 Operations Architecture: external review completed
  • D4.7 Operations sustainability: no progress last week

SA1.1 Activity Management

  • meetings:
    • GDB for discussion of suspension policy related to the deployment of unsupported glite 3.1/3.2 services
    • ISGC PC phone call
    • fedSM weekly call
    • COD meeting
    • PRACE/EUDAT/EGI workshop organization meeting
    • EGI Operations meeting, chaired and agenda preparation
    • Fedclouds task force
    • COD/CSIRT meeting to evaluate the status of unsupported middleware monitoring process
    • COD meeting to evaluate status of current activities, and promote new ones: revision of current and future probes, support to emerging NGIs and new site administrators, technical revision of automated certification procedure (in progress)

Main activity of the last 7 days was the coordination of activities around the decommissioning of unsupported software, followup of missing/incomplete probes, procedures for ticketing of sites and related escalation

  • Follow-up of suspension cases for low availability
  • Revision of operations service portfolio following gSLM guidelines (preparatory work for D4.7 deliverable on service sustainability)
  • ops VO membership management
  • DTEAM VO membership management
  • operations ticket handling, handling of various A/R esclations
  • Produced assessment on the VO usage of deployed gLite services
  • Follow up of sites/NGIs with missing security contacts
  • Preparation of OMB agenda

SA1.2 Security

  • Continued to provide the infrastructure to monitor obsolete gLite middleware and participated in several discussions with operations and COD people
  • Chasing sites/tickets related to obsolete gLite middleware
  • Still chasing few sites that have not yet upgraded WMS to address vulnerability #4073
  • TI accreditation process completed, with draft version of RFC 2350 (Expectations for Computer Security Incident Response) submitted. Now awaiting response.
  • Start planning for a security training workshop at ISGC2013
  • Ongoing discussions about IRTF staffing/funding
  • Presentation of HEP federated identity management pilot project at HEPiX meeting
  • Presentation on Grid Security Policies, Procedures and activities at Shonan meeting

SA1.3 Staged rollout

  • UMD 1.9.0 release freeze, includes: IGE SAGA, IGE Gridway, BDII Core, CREAM, GFAL/lcg_utils, WMS, ARC
  • Continuation of SW provisioning towards UMD 2.3.0
  • Grid Operations meeting: 22 October

SA1.3 Integration

  • collection of feedback from GLOBUS TF about site certification procedure
  • planning of workshop on use cases for data management across peer infrastructures

SA1.4 Central tools

SA1.5 Accounting

Worked with a number of sites to publish UserDNs.

NDGF now publishing from SGAS via SSM in production.

Final testing of DGAS with SSM.

SA1.6 Helpdesk

  • revision of ticket escalation procedures in case of tickets without activity
  • finalize the next GGUS release 2012-10-24
  • preparing the first meeting of the GGUS advisory board

SA1.7 Support

  • published newsletter
  • usual tickethandling and dashboard work
  • cod meeting
  • prepare cod f2f

Software Support

Working smoothly, no specific issues to report. 
DMSU tickets flow Oct 14 -- Oct 20
assigned 16
back to tpm 0
reassigned to 3rd level 12
solved 3
open DMSU tickets status
assigned 0
in progress 1
waiting for reply 4
on hold 2

Network Support

  • Installed new LFC catalogue machine on gridsrv6-net.dir.garr.it for IPv6 testing
  • Resumed CREAM CE testing and reporting

SA1.8 Availability and core services

  • revision of EGI.eu OLA (Malgorzata) according to structure of operations service catalog. Organization of meetings with technical providers to agree/discuss about suggested service targets
  • DTEAM VOMRS migration in progress
  • DTEAM VO management duties
  • installed new VOMS instance (from UMD2 release) to test VOMRS migration procedure
  • published final A/R reports for September
  • follow up on A/R recomputation requests
  • discussion of recomputations of A/R statistics fro September for all sites affected by problem:

https://tomtools.cern.ch/jira/browse/SAM-2999


Documentation

  • ongoing work on the operations wiki space
  • ongoing work on EGI OLA

Meetings

  • PRACE/EGI/community meeting on data management (October/beginning of November)