Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "EGI-InSPIRE:Sa1 2012-11-28"

From EGIWiki
Jump to navigation Jump to search
Line 1: Line 1:
{{Template:Op menubar}}
{{Template:Op menubar}} {{TOC_right}}[[Inspire-sa1/weekly-reports|<< SA1 weekly reports]]  
{{TOC_right}}
[[Category:SA1 weekly report]]
[[Inspire-sa1/weekly-reports|<< SA1 weekly reports]]


=Progress of SA1 issues=  
=Progress of SA1 issues= <!-- T. Ferrari --> Nothing new to report.  
<!-- T. Ferrari -->
Nothing new to report.


=Milestones/Deliverables=
=Milestones/Deliverables= <!-- T. Ferrari -->  
<!-- T. Ferrari -->
* D4.7 Operations Sustainability. Still in draft status, will be ready for review next Monday.


=SA1.1 Activity Management=
*D4.7 Operations Sustainability. Still in draft status, will be ready for review next Monday.
<!-- S. Burke, T. Ferrari,  M. Krakowian, P. Solagna -->


MEETINGS
=SA1.1 Activity Management= <!-- S. Burke, T. Ferrari,  M. Krakowian, P. Solagna -->
* JRA1 meeting
* organization and contribution to the EGI/EUDAT/PRACE workshop
* security meeting


ACTIVITIES
MEETINGS
* contribution to the defininition of a new VO A/R calculation algorithm
* discussion of problem of NGI A/R statistics, which are currently affected by sites that are in production just for a fraction of the calendar month
* Follow up of a GOCDB outage
* Follow up of message broker network outage affecting all NGIs (November A/R statistics will be recomputed)
* OPS VO management
* Submission of two abstracts for ISGC 2013
* coordination of plan for retirement of unsupported software, and discussion of a new plan and of the related development actions needed, for decommissioning of gLite 3.2 WN, DPM, LFC and EMI 1. 1/3 of the affected services retired/upgraded in the last 2 weeks. Discussion of an action plan to check the progress of upgrade plans by end of November
* contribution of ideas for the preparation of the EGI Champion programme


=SA1.2 Security=
*JRA1 meeting  
<!-- D. Kelsey -->
*organization and contribution to the EGI/EUDAT/PRACE workshop  
* monthly CSIRT team meeting held on 22 November
*security meeting
* activities for tracking upgrading status from gLite 3.2 components
* Including handling potential suspension of unresponsive sites - not needed in the end - all went into downtime
* SVG - handling of several new issues
* planning for security workshop and submission of 2 talks to ISGC 2013
* development work on new custom security probes for NGI SAM instance
* '''no''' sites were eventually suspended because of the lack of feedback about upgrade plans


= SA1.3 Staged rollout =
ACTIVITIES
*EMI1 update 21 and EMI2 update 6 have entered into the SW provisioning process
 
**Some components already under verification (LB 3.2.9 from EMI1, and gridsite).
*contribution to the defininition of a new VO A/R calculation algorithm
**UMD urgent release foreseen for some components, at least WN and DPM
*discussion of problem of NGI A/R statistics, which are currently affected by sites that are in production just for a fraction of the calendar month
*SAM19 completed SW provisioning and released into production.
*Follow up of a GOCDB outage
*CA 1.51 completed SW provisioning expected to be released this week
*Follow up of message broker network outage affecting all NGIs (November A/R statistics will be recomputed)
*OPS VO management
*Submission of two abstracts for ISGC 2013
*coordination of plan for retirement of unsupported software, and discussion of a new plan and of the related development actions needed, for decommissioning of gLite 3.2 WN, DPM, LFC and EMI 1. 1/3 of the affected services retired/upgraded in the last 2 weeks. Discussion of an action plan to check the progress of upgrade plans by end of November
*contribution of ideas for the preparation of the EGI Champion programme
 
=SA1.2 Security= <!-- D. Kelsey -->
 
*monthly CSIRT team meeting held on 22 November
*activities for tracking upgrading status from gLite 3.2 components
*Including handling potential suspension of unresponsive sites - not needed in the end - all went into downtime
*SVG - handling of several new issues
*planning for security workshop and submission of 2 talks to ISGC 2013
*development work on new custom security probes for NGI SAM instance
*'''no''' sites were eventually suspended because of the lack of feedback about upgrade plans
 
= SA1.3 Staged rollout =
 
*EMI1 update 21 and EMI2 update 6 have entered into the SW provisioning process  
**Some components already under verification (LB 3.2.9 from EMI1, and gridsite).  
**UMD urgent release foreseen for some components, at least WN and DPM  
*SAM19 completed SW provisioning and released into production.  
*CA 1.51 completed SW provisioning expected to be released this week  
*New changes into the SW provisioning process under discussion
*New changes into the SW provisioning process under discussion


=SA1.3 Integration=
=SA1.3 Integration= <!-- E. Imamagic, M. Krakowian from Nov 2012 -->  
<!-- E. Imamagic, M. Krakowian from Nov 2012 -->
 
* Participation to the envri use case meeting
*Participation to the envri use case meeting  
* discussion of MAPPER helpdesk strategy and new actions
*discussion of MAPPER helpdesk strategy and new actions  
* Organization of the EGI/EUDAT/PRACE meeting, with the contribution of 6 user communities
*Organization of the EGI/EUDAT/PRACE meeting, with the contribution of 6 user communities
 
=SA1.4 Central tools= <!--E. Imamagic -->
 
*Upgrade of AUTH and SRCE brokers finished successfully, CERN brokers will be upgraded tomorrow (https://operations-portal.egi.eu/broadcast/archive/id/817).
*SAM Update-19 released on November 23rd
*prolonged downtime of GOCDB because of power issues, the failover configuration did not prove to be ready to replace the primary instance affected by the problem. Action on EGI operations to discuss a better failover strategy for GOCDB.
*APEL downtime caused by issues of the power supply system of the hosting organization
 
= SA1.5 Accounting =


=SA1.4 Central tools=
'''<!--A. Packer--> Repository ''' - outage last Tuesday, 20th November due to problem caused by electrical work carried out on the UPS at RAL, all APEL systems restored on Wednesday, 21st November. Sites continue to raise tickets in order to republish user DNs.
<!--E. Imamagic -->


* Upgrade of AUTH and SRCE brokers finished successfully, CERN brokers will be upgraded tomorrow (https://operations-portal.egi.eu/broadcast/archive/id/817).
=SA1.6 Helpdesk= <!-- T. Antoni -->
* SAM Update-19 released on November 23rd
* prolonged downtime of GOCDB because of power issues, the failover configuration did not prove to be ready to replace the primary instance affected by the problem. Action on EGI operations to discuss a better failover strategy for GOCDB.
* APEL downtime caused by issues of the power supply system of the hosting organization


=SA1.5 Accounting=
*Shopping list meeting to prioritise requests for GGUS
'''<!--A. Packer--> Repository '''
*Preparing the GGUS-AB phone conference next Thursday (2012-11-29), see agenda: https://indico.egi.eu/indico/conferenceDisplay.py?confId=1259
- outage last Tuesday, 20th November due to problem caused by electrical work carried out on the UPS at RAL, all APEL systems restored on Wednesday, 21st November.  Sites continue to raise tickets in order to republish user DNs.
*Preparing the next GGUS release on 2012-11-28


=SA1.6 Helpdesk=  
=SA1.7 Support= <!-- trompert -->  
<!-- T. Antoni -->
* Shopping list meeting to prioritise requests for GGUS
* Preparing the GGUS-AB phone conference next Thursday (2012-11-29), see agenda: https://indico.egi.eu/indico/conferenceDisplay.py?confId=1259
* Preparing the next GGUS release on 2012-11-28


=SA1.7 Support=
*usual ticket handling and dashboard work  
<!--  trompert -->
*handling unsupported middleware tickets  
* usual ticket handling and dashboard work
*review of GOCdb business logic  
* handling unsupported middleware tickets
*wrote plan for future COD activities
* review of GOCdb business logic
* wrote plan for future COD activities


== Software Support ==
== Software Support == <!-- A Krenek --> No report received  
<!-- A Krenek -->
No report received


== Network Support  ==
== Network Support  ==


Started assessment of IPv6 compliance of FTS 3.
Started assessment of IPv6 compliance of FTS 3.  
 
=SA1.8 Availability and core services= <!--C. Kanellopoulos-->
 
* Ongoing communications relevant to dteam VO migration to EMI VOMS service endpoint
* Initialized dummy "test" VO and subgroups on EMI VOMS instance to perform tests
* 3 issues identified related to how registration of users cannot be handled in the same way on VOMS as it was on VOMRS
* Handling of tickets in SLM support unit
* Re-uploaded final A/R reports for October - expecting newer version this week as there were issues with the one uploaded last week.
* Operation of dteam VO VOMRS service endpoint
 
*Revision of A/R algorithm for the computation of NGI availability/reliability statistics
*assessment of migration plan from VOMRS to VOMS and impact on DTEAM VO management duties. Upgrade date was postponed.
 
== Documentation == <!-- M. Krakowian -->
 
*[[EGI Operations Start Guide|EGI Operations Start Guide]] for newcomers was created.
*[[PROC04|PROC04]] was reviewed and split into two pages [[PROC04|PROC04]] and [[Availability and reliability monthly statistics|Availability and reliability monthly statistics]]
*[[Administrator Documentation|Documentation page for administrators]] was created


=SA1.8 Availability and core services=
= Meetings= <!--all-->  
<!--C. Kanellopoulos-->
* Revision of A/R algorithm for the computation of NGI availability/reliability statistics
* assessment of migration plan from VOMRS to VOMS and impact on DTEAM VO management duties. Upgrade date was postponed.


== Documentation ==
*TERENA Network Operations Task Force meeting, Poznan, 12-13 Dec 2012
<!-- M. Krakowian -->
* [[EGI_Operations_Start_Guide | EGI Operations Start Guide]] for newcomers was created.
* [[PROC04| PROC04]] was reviewed and split into two pages [[PROC04|PROC04]] and [[Availability_and_reliability_monthly_statistics| Availability and reliability monthly statistics]]
* [[Administrator_Documentation|Documentation page for administrators]] was created


= Meetings=
[[Category:SA1_weekly_report]]
<!--all-->
* TERENA Network Operations Task Force meeting, Poznan, 12-13 Dec 2012

Revision as of 09:39, 28 November 2012

Main EGI.eu operations services Support Documentation Tools Activities Performance Technology Catch-all Services Resource Allocation Security


<< SA1 weekly reports

=Progress of SA1 issues= Nothing new to report.

Milestones/Deliverables

  • D4.7 Operations Sustainability. Still in draft status, will be ready for review next Monday.

SA1.1 Activity Management

MEETINGS

  • JRA1 meeting
  • organization and contribution to the EGI/EUDAT/PRACE workshop
  • security meeting

ACTIVITIES

  • contribution to the defininition of a new VO A/R calculation algorithm
  • discussion of problem of NGI A/R statistics, which are currently affected by sites that are in production just for a fraction of the calendar month
  • Follow up of a GOCDB outage
  • Follow up of message broker network outage affecting all NGIs (November A/R statistics will be recomputed)
  • OPS VO management
  • Submission of two abstracts for ISGC 2013
  • coordination of plan for retirement of unsupported software, and discussion of a new plan and of the related development actions needed, for decommissioning of gLite 3.2 WN, DPM, LFC and EMI 1. 1/3 of the affected services retired/upgraded in the last 2 weeks. Discussion of an action plan to check the progress of upgrade plans by end of November
  • contribution of ideas for the preparation of the EGI Champion programme

SA1.2 Security

  • monthly CSIRT team meeting held on 22 November
  • activities for tracking upgrading status from gLite 3.2 components
  • Including handling potential suspension of unresponsive sites - not needed in the end - all went into downtime
  • SVG - handling of several new issues
  • planning for security workshop and submission of 2 talks to ISGC 2013
  • development work on new custom security probes for NGI SAM instance
  • no sites were eventually suspended because of the lack of feedback about upgrade plans

SA1.3 Staged rollout

  • EMI1 update 21 and EMI2 update 6 have entered into the SW provisioning process
    • Some components already under verification (LB 3.2.9 from EMI1, and gridsite).
    • UMD urgent release foreseen for some components, at least WN and DPM
  • SAM19 completed SW provisioning and released into production.
  • CA 1.51 completed SW provisioning expected to be released this week
  • New changes into the SW provisioning process under discussion

SA1.3 Integration

  • Participation to the envri use case meeting
  • discussion of MAPPER helpdesk strategy and new actions
  • Organization of the EGI/EUDAT/PRACE meeting, with the contribution of 6 user communities

SA1.4 Central tools

  • Upgrade of AUTH and SRCE brokers finished successfully, CERN brokers will be upgraded tomorrow (https://operations-portal.egi.eu/broadcast/archive/id/817).
  • SAM Update-19 released on November 23rd
  • prolonged downtime of GOCDB because of power issues, the failover configuration did not prove to be ready to replace the primary instance affected by the problem. Action on EGI operations to discuss a better failover strategy for GOCDB.
  • APEL downtime caused by issues of the power supply system of the hosting organization

SA1.5 Accounting

Repository - outage last Tuesday, 20th November due to problem caused by electrical work carried out on the UPS at RAL, all APEL systems restored on Wednesday, 21st November. Sites continue to raise tickets in order to republish user DNs.

SA1.6 Helpdesk

SA1.7 Support

  • usual ticket handling and dashboard work
  • handling unsupported middleware tickets
  • review of GOCdb business logic
  • wrote plan for future COD activities

== Software Support == No report received

Network Support

Started assessment of IPv6 compliance of FTS 3.

SA1.8 Availability and core services

  • Ongoing communications relevant to dteam VO migration to EMI VOMS service endpoint
  • Initialized dummy "test" VO and subgroups on EMI VOMS instance to perform tests
  • 3 issues identified related to how registration of users cannot be handled in the same way on VOMS as it was on VOMRS
  • Handling of tickets in SLM support unit
  • Re-uploaded final A/R reports for October - expecting newer version this week as there were issues with the one uploaded last week.
  • Operation of dteam VO VOMRS service endpoint
  • Revision of A/R algorithm for the computation of NGI availability/reliability statistics
  • assessment of migration plan from VOMRS to VOMS and impact on DTEAM VO management duties. Upgrade date was postponed.

Documentation

Meetings

  • TERENA Network Operations Task Force meeting, Poznan, 12-13 Dec 2012