Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "EGI-InSPIRE:Sa1 2012-10-17"

From EGIWiki
Jump to navigation Jump to search
Line 50: Line 50:
=SA1.4 Central tools=  
=SA1.4 Central tools=  
<!--E. Imamagic -->
<!--E. Imamagic -->
No report received so far
*By the end of Tuesday 23 NGIs have deployed SAM Update-17
*Issue with Update-17 and SL6 WN is still being investigated: https://tomtools.cern.ch/jira/browse/SAM-2999
*Help with MW monitoring instance setup
*Participation at SAM workshop at CERN
*Discussion between EMI PT about open actions and future plans for Message Broker Network.
*False alarms raised in Dashboard by SAM-Nagios instance in Beijing: https://ggus.eu/ws/ticket_info.php?ticket=87276


= SA1.5 Accounting =
= SA1.5 Accounting =

Revision as of 17:42, 16 October 2012

Main EGI.eu operations services Support Documentation Tools Activities Performance Technology Catch-all Services Resource Allocation Security


SA1 weekly report

Progress of SA1 issues

(SA1) Integration of Albania: the site managers who attended training at TF12 promised progress in the coming months through the creation of the first Albanian site

Milestones/Deliverables

  • D4.6 Operations Architecture: incorporated all changes requested through the received reviews. Incorporating last changes after the moderator's review received today
  • D4.7: laying down structure of the document. Collection of new input received through the OMB survey after TF12

SA1.1 Activity Management

  • assessment of status and progress of COD tickets for unsupported gLite 3.1/3.2 services
  • assessment of problems with ARC-CE tests
  • assessment of status of EMI 1/2 WN testing and distribution of information to relevant lists
  • collection of information of service deployment by VO and discipline
  • ops VO membership management
  • follow up of a SAM issue with the SL6 worker nodes
  • update of the responses analysis for the operations sustainability survey
  • meetings: MAPPER phone conference, GlobusOnLine internal meeting, UCB meeting, FedSM meeting, GDB (full day), WLCG MB, FedCloud (chaired), WLCG Fed ID kickoff meeting

SA1.2 Security

  • ongoing tuning of security dashboard and security nagios to improve the tool for the followup of sites deploying unsupported gLite 3.1/3.2 software (tests added recently to track the deployment of lcg-ce and CREAM instances that could not found with existing scripts)
  • sites reported to be affected by CRITICAL errors are being ticketed by COD
  • starting to see some fallout from the middleware version monitoring in the form of questions from sites, otherwise things are quiet
  • discussion of duties and effort for running of incident response

SA1.3 Staged rollout

  • Release of UMD 2.2.0 on the 9 October.
  • Staged rollout of several components towards UMD 1.9.0 on the 29 October, the preliminary list:
    • ARC 1.1.1 (all components), CREAM 1.13.5 (1.13.4 from EMI1), WMS 3.3.8, BDII-core 1.4.0, GFAL/lcg_utils 1.13.0, IGE Gridway 5.10.2, IGE-SAGA 1.6.1
    • Some of the previous components are already in UMDStore area, and the other are under staged rollout some with reports already delivered.
  • Preparation of SW provisioning of EMI2 components towards UMD 2.3.0 around the middle of November
    • AMGA 2.3.0: staged rollout has finished and will be passed to UMDStore area closer to the freeze date.
    • FTS 2.2.8 and EMIR 1.2.0 are under verification
    • CREAM 1.14.1, dCache 2.2.4, UNICORE/X6 5.0.1 and UNICORE HILA 2.3.0 to start verification this week
  • http://www.lip.pt/computing/apps/EGI_EA/index.php?option=1 will be used to get the metrics for the QR for SA1.3

SA1.3 Integration

  • status assessment of MAPPER integration and proposal of ticket workflows involving EGI and PRACE helpdesk
  • organization of EGI/EUDAT/PRACE workshop on data management use cases

SA1.4 Central tools

SA1.5 Accounting

  • Several sites republishing old data with UserDNs.
  • Temporary glitch with portal corrupted the tier2 topology and hence the monthly Tier2 report to WLCG. Seems to fix itself.
  • NGI_CH started publishing in production from two ARC sites via SGAS..
  • A new site in OSG highlighted a sub-optimal synchronisation with REBUS in the new publishing route.

SA1.6 Helpdesk

  • Shopping list meeting to prioritise requests for GGUS on Wed, Oct. 10th
  • Preparing next release on 2012-10-24
  • discussion on SNOW helpdesk (CERN) closing tickets in case of no user reply for 15 days. EGI has currently no policy like this in place.

SA1.7 Support

  • published newsletter
  • generated tickets for obsolete s/w
  • regular ticket handling, (unknown,a/r, RPI and top-bdii)
  • preparation COD F2F
  • preparation COD-COO phone conf

Software Support

No report received so far

Network Support

Nothing to report.

SA1.8 Availability and core services

  • followup of underperforming sites, new automated procedure for followup of underperforming sites

Documentation

  • restructuring of the operations wiki space
  • ongoing work on EGI OLA
  • ongoing work on RC certyfication procedure: adding comments from UNICORE and Globus

Meetings

  • PRACE/EGI/community meeting on data management (November/beginning of December)