Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "EGI-InSPIRE:Sa1 2012-11-28"

From EGIWiki
Jump to navigation Jump to search
(Created page with "{{Template:Op menubar}} {{TOC_right}} Category:SA1 weekly report SA1 weekly report =Progress of SA1 issues= <!-- T. Ferrari --> =Milestones/Deliverables= <!-- T. Ferrari ...")
 
 
(35 intermediate revisions by 13 users not shown)
Line 1: Line 1:
{{Template:Op menubar}}
{{Template:EGI-Inspire menubar}}
{{TOC_right}}
[[Category:SA1 weekly report]]
SA1 weekly report


=Progress of SA1 issues=
{{Template:Inspire_reports_menubar}}
<!-- T. Ferrari -->
{{TOC_right}}




=Milestones/Deliverables=
<!-- T. Ferrari -->
* D4.7 Operations Sustainability


=SA1.1 Activity Management=  
=Progress of SA1 issues= <!-- T. Ferrari -->  
<!-- S. Burke, T. Ferrari,  M. Krakowian, P. Solagna -->


=SA1.2 Security=
Nothing new to report.  
<!-- D. Kelsey -->


= SA1.3 Staged rollout =
=Milestones/Deliverables= <!-- T. Ferrari -->


=SA1.3 Integration=
*D4.7 Operations Sustainability. Still in draft status, will be ready for review next Monday.
<!-- E. Imamagic, M. Krakowian from Nov 2012 -->


=SA1.1 Activity Management= <!-- S. Burke, T. Ferrari,  M. Krakowian, P. Solagna -->


=SA1.4 Central tools=
MEETINGS
<!--E. Imamagic -->


=SA1.5 Accounting=
*JRA1 meeting
<!--J. Gordon-->
*organization and contribution to the EGI/EUDAT/PRACE workshop
*security meeting


=SA1.6 Helpdesk=
ACTIVITIES
<!-- T. Antoni -->


*contribution to the defininition of a new VO A/R calculation algorithm
*discussion of problem of NGI A/R statistics, which are currently affected by sites that are in production just for a fraction of the calendar month
*Follow up of a GOCDB outage
*Follow up of message broker network outage affecting all NGIs (November A/R statistics will be recomputed)
*OPS VO management
*Submission of two abstracts for ISGC 2013
*coordination of plan for retirement of unsupported software, and discussion of a new plan and of the related development actions needed, for decommissioning of gLite 3.2 WN, DPM, LFC and EMI 1. 1/3 of the affected services retired/upgraded in the last 2 weeks. Discussion of an action plan to check the progress of upgrade plans by end of November
*contribution of ideas for the preparation of the EGI Champion programme


=SA1.7 Support=
=SA1.2 Security= <!-- D. Kelsey -->  
<!-- trompert -->


== Software Support ==
*monthly CSIRT team meeting held on 22 November
<!-- A Krenek -->
*activities for tracking upgrading status from gLite 3.2 components
*Including handling potential suspension of unresponsive sites - not needed in the end - all went into downtime
*SVG - handling of several new issues
*planning for security workshop and submission of 2 talks to ISGC 2013
*development work on new custom security probes for NGI SAM instance
*'''no''' sites were eventually suspended because of the lack of feedback about upgrade plans
 
= SA1.3 Staged rollout  =
 
*EMI1 update 21 and EMI2 update 6 have entered into the SW provisioning process
**Some components already under verification (LB 3.2.9 from EMI1, and gridsite).
**UMD urgent release foreseen for some components, at least WN and DPM
*SAM19 completed SW provisioning and released into production.
*CA 1.51 completed SW provisioning expected to be released this week
*New changes into the SW provisioning process under discussion
 
=SA1.3 Integration= <!-- E. Imamagic, M. Krakowian from Nov 2012 -->
 
*Participation to the envri use case meeting
*discussion of MAPPER helpdesk strategy and new actions
*Organization of the EGI/EUDAT/PRACE meeting, with the contribution of 6 user communities
 
=SA1.4 Central tools= <!--E. Imamagic -->
 
*Upgrade of AUTH and SRCE brokers finished successfully, CERN brokers will be upgraded tomorrow (https://operations-portal.egi.eu/broadcast/archive/id/817).
*SAM Update-19 released on November 23rd
*prolonged downtime of GOCDB because of power issues, the failover configuration did not prove to be ready to replace the primary instance affected by the problem. Action on EGI operations to discuss a better failover strategy for GOCDB.
*APEL downtime caused by issues of the power supply system of the hosting organization
 
= SA1.5 Accounting =
 
'''<!--A. Packer--> Repository ''' - outage last Tuesday, 20th November due to problem caused by electrical work carried out on the UPS at RAL, all APEL systems restored on Wednesday, 21st November. Sites continue to raise tickets in order to republish user DNs.
 
'''Portal'''
The new Fomalhaut release of the Accounting Portal is now available on
http://accounting.egi.eu.
 
There are several changes in this version:
* Improved UserDN country classification patterns.
* Improvements on usage by country.
* GET interface for CSV
* Support new RFC 2253 UserDNs.
* Better support for custom VOs.
* UserDN NGI attribution
* Support for local jobs, there are three options, selectable on most views:
  * Only Grid jobs (default).
  * Grid+Local jobs - In case there is a corresponding global VO, both are aggregated
  * Only Local jobs
* Many fixes and optimizations.
 
There are also many important changes and new views on the usage reporting
between NGIs and countries that are on hold pending approval of the EGI Usage
VT. Interested parties can visit http://accounting-devel.egi.eu to check them.
 
=SA1.6 Helpdesk= <!-- T. Antoni -->
 
*Shopping list meeting to prioritise requests for GGUS
*Preparing the GGUS-AB phone conference next Thursday (2012-11-29), see agenda: https://indico.egi.eu/indico/conferenceDisplay.py?confId=1259
*Preparing the next GGUS release on 2012-11-28
 
=SA1.7 Support= <!--  trompert -->
 
*usual ticket handling and dashboard work
*handling unsupported middleware tickets
*review of GOCdb business logic
*wrote plan for future COD activities
 
== Software Support == <!-- A Krenek --> No report received


== Network Support  ==
== Network Support  ==


=SA1.8 Availability and core services=
Started assessment of IPv6 compliance of FTS 3.
<!--C. Kanellopoulos-->
 
=SA1.8 Availability and core services= <!--C. Kanellopoulos-->
 
* Ongoing communications relevant to dteam VO migration to EMI VOMS service endpoint
* Initialized dummy "test" VO and subgroups on EMI VOMS instance to perform tests
* 3 issues identified related to how registration of users cannot be handled in the same way on VOMS as it was on VOMRS
* Handling of tickets in SLM support unit
* Re-uploaded final A/R reports for October - expecting newer version this week as there were issues with the one uploaded last week.
* Operation of dteam VO VOMRS service endpoint
 
*Revision of A/R algorithm for the computation of NGI availability/reliability statistics
*assessment of migration plan from VOMRS to VOMS and impact on DTEAM VO management duties. Upgrade date was postponed.
 
== Documentation == <!-- M. Krakowian -->  


*[[EGI Operations Start Guide|EGI Operations Start Guide]] for newcomers was created.
*[[PROC04|PROC04]] was reviewed and split into two pages [[PROC04|PROC04]] and [[Availability and reliability monthly statistics|Availability and reliability monthly statistics]]
*[[Administrator Documentation|Documentation page for administrators]] was created


== Documentation ==
= Meetings= <!--all-->  
<!-- M. Krakowian -->


= Meetings=
*TERENA Network Operations Task Force meeting, Poznan, 12-13 Dec 2012
<!--all-->
*

Latest revision as of 17:26, 6 January 2015

EGI Inspire Main page


Inspire reports menu: Home SA1 weekly Reports SA1 Task QR Reports NGI QR Reports NGI QR User support Reports




Progress of SA1 issues

Nothing new to report.

Milestones/Deliverables

  • D4.7 Operations Sustainability. Still in draft status, will be ready for review next Monday.

SA1.1 Activity Management

MEETINGS

  • JRA1 meeting
  • organization and contribution to the EGI/EUDAT/PRACE workshop
  • security meeting

ACTIVITIES

  • contribution to the defininition of a new VO A/R calculation algorithm
  • discussion of problem of NGI A/R statistics, which are currently affected by sites that are in production just for a fraction of the calendar month
  • Follow up of a GOCDB outage
  • Follow up of message broker network outage affecting all NGIs (November A/R statistics will be recomputed)
  • OPS VO management
  • Submission of two abstracts for ISGC 2013
  • coordination of plan for retirement of unsupported software, and discussion of a new plan and of the related development actions needed, for decommissioning of gLite 3.2 WN, DPM, LFC and EMI 1. 1/3 of the affected services retired/upgraded in the last 2 weeks. Discussion of an action plan to check the progress of upgrade plans by end of November
  • contribution of ideas for the preparation of the EGI Champion programme

SA1.2 Security

  • monthly CSIRT team meeting held on 22 November
  • activities for tracking upgrading status from gLite 3.2 components
  • Including handling potential suspension of unresponsive sites - not needed in the end - all went into downtime
  • SVG - handling of several new issues
  • planning for security workshop and submission of 2 talks to ISGC 2013
  • development work on new custom security probes for NGI SAM instance
  • no sites were eventually suspended because of the lack of feedback about upgrade plans

SA1.3 Staged rollout

  • EMI1 update 21 and EMI2 update 6 have entered into the SW provisioning process
    • Some components already under verification (LB 3.2.9 from EMI1, and gridsite).
    • UMD urgent release foreseen for some components, at least WN and DPM
  • SAM19 completed SW provisioning and released into production.
  • CA 1.51 completed SW provisioning expected to be released this week
  • New changes into the SW provisioning process under discussion

SA1.3 Integration

  • Participation to the envri use case meeting
  • discussion of MAPPER helpdesk strategy and new actions
  • Organization of the EGI/EUDAT/PRACE meeting, with the contribution of 6 user communities

SA1.4 Central tools

  • Upgrade of AUTH and SRCE brokers finished successfully, CERN brokers will be upgraded tomorrow (https://operations-portal.egi.eu/broadcast/archive/id/817).
  • SAM Update-19 released on November 23rd
  • prolonged downtime of GOCDB because of power issues, the failover configuration did not prove to be ready to replace the primary instance affected by the problem. Action on EGI operations to discuss a better failover strategy for GOCDB.
  • APEL downtime caused by issues of the power supply system of the hosting organization

SA1.5 Accounting

Repository - outage last Tuesday, 20th November due to problem caused by electrical work carried out on the UPS at RAL, all APEL systems restored on Wednesday, 21st November. Sites continue to raise tickets in order to republish user DNs.

Portal The new Fomalhaut release of the Accounting Portal is now available on http://accounting.egi.eu.

There are several changes in this version:

  • Improved UserDN country classification patterns.
  • Improvements on usage by country.
  • GET interface for CSV
  • Support new RFC 2253 UserDNs.
  • Better support for custom VOs.
  • UserDN NGI attribution
  • Support for local jobs, there are three options, selectable on most views:
 * Only Grid jobs (default).
 * Grid+Local jobs - In case there is a corresponding global VO, both are aggregated
 * Only Local jobs
  • Many fixes and optimizations.

There are also many important changes and new views on the usage reporting between NGIs and countries that are on hold pending approval of the EGI Usage VT. Interested parties can visit http://accounting-devel.egi.eu to check them.

SA1.6 Helpdesk

SA1.7 Support

  • usual ticket handling and dashboard work
  • handling unsupported middleware tickets
  • review of GOCdb business logic
  • wrote plan for future COD activities

== Software Support == No report received

Network Support

Started assessment of IPv6 compliance of FTS 3.

SA1.8 Availability and core services

  • Ongoing communications relevant to dteam VO migration to EMI VOMS service endpoint
  • Initialized dummy "test" VO and subgroups on EMI VOMS instance to perform tests
  • 3 issues identified related to how registration of users cannot be handled in the same way on VOMS as it was on VOMRS
  • Handling of tickets in SLM support unit
  • Re-uploaded final A/R reports for October - expecting newer version this week as there were issues with the one uploaded last week.
  • Operation of dteam VO VOMRS service endpoint
  • Revision of A/R algorithm for the computation of NGI availability/reliability statistics
  • assessment of migration plan from VOMRS to VOMS and impact on DTEAM VO management duties. Upgrade date was postponed.

Documentation

Meetings

  • TERENA Network Operations Task Force meeting, Poznan, 12-13 Dec 2012