Difference between revisions of "EGI-InSPIRE:Sa1 2012-10-17"
Jump to navigation
Jump to search
(22 intermediate revisions by 10 users not shown) | |||
Line 1: | Line 1: | ||
{{Template: | {{Template:EGI-Inspire menubar}} | ||
{{Template:Inspire_reports_menubar}} | |||
{{TOC_right}} | |||
=Progress of SA1 issues= | =Progress of SA1 issues= | ||
<!-- T. Ferrari --> | <!-- T. Ferrari --> | ||
Line 10: | Line 9: | ||
=Milestones/Deliverables= | =Milestones/Deliverables= | ||
<!-- T. Ferrari --> | <!-- T. Ferrari --> | ||
* D4.6 Operations Architecture: incorporated all changes requested through the received reviews. | * D4.6 Operations Architecture: incorporated all changes requested through the received reviews. Incorporating last changes after the moderator's review received today | ||
* D4.7: laying down structure of the document. Collection of new input received through the OMB survey after TF12 | * D4.7: laying down structure of the document. Collection of new input received through the OMB survey after TF12 | ||
Line 19: | Line 18: | ||
* assessment of status of EMI 1/2 WN testing and distribution of information to relevant lists | * assessment of status of EMI 1/2 WN testing and distribution of information to relevant lists | ||
* collection of information of service deployment by VO and discipline | * collection of information of service deployment by VO and discipline | ||
* meetings: MAPPER phone conference, GlobusOnLine internal meeting, UCB meeting, FedSM meeting, GDB (full day), WLCG MB, FedCloud | * ops VO membership management | ||
* follow up of a SAM issue with the SL6 worker nodes | |||
* update of the responses analysis for the operations sustainability survey | |||
* meetings: MAPPER phone conference, GlobusOnLine internal meeting, UCB meeting, FedSM meeting, GDB (full day), WLCG MB, FedCloud (chaired), WLCG Fed ID kickoff meeting | |||
=SA1.2 Security= | =SA1.2 Security= | ||
Line 25: | Line 27: | ||
* ongoing tuning of security dashboard and security nagios to improve the tool for the followup of sites deploying unsupported gLite 3.1/3.2 software (tests added recently to track the deployment of lcg-ce and CREAM instances that could not found with existing scripts) | * ongoing tuning of security dashboard and security nagios to improve the tool for the followup of sites deploying unsupported gLite 3.1/3.2 software (tests added recently to track the deployment of lcg-ce and CREAM instances that could not found with existing scripts) | ||
* sites reported to be affected by CRITICAL errors are being ticketed by COD | * sites reported to be affected by CRITICAL errors are being ticketed by COD | ||
* starting to see some fallout from the middleware version monitoring in the form of questions from sites, otherwise things are quiet | |||
* discussion of duties and effort for running of incident response | * discussion of duties and effort for running of incident response | ||
= SA1.3 Staged rollout = | = SA1.3 Staged rollout = | ||
*Release of UMD 2.2.0 on the 9 October. | |||
*Staged rollout of several components towards UMD 1.9.0 on the 29 October, the preliminary list: | |||
**ARC 1.1.1 (all components), CREAM 1.13.5 (1.13.4 from EMI1), WMS 3.3.8, BDII-core 1.4.0, GFAL/lcg_utils 1.13.0, IGE Gridway 5.10.2, IGE-SAGA 1.6.1 | |||
**Some of the previous components are already in UMDStore area, and the other are under staged rollout some with reports already delivered. | |||
*Preparation of SW provisioning of EMI2 components towards UMD 2.3.0 around the middle of November | |||
**AMGA 2.3.0: staged rollout has finished and will be passed to UMDStore area closer to the freeze date. | |||
**FTS 2.2.8 and EMIR 1.2.0 are under verification | |||
**CREAM 1.14.1, dCache 2.2.4, UNICORE/X6 5.0.1 and UNICORE HILA 2.3.0 to start verification this week | |||
*http://www.lip.pt/computing/apps/EGI_EA/index.php?option=1 will be used to get the metrics for the QR for SA1.3 | |||
=SA1.3 Integration= | =SA1.3 Integration= | ||
Line 36: | Line 49: | ||
=SA1.4 Central tools= | =SA1.4 Central tools= | ||
<!--E. Imamagic --> | <!--E. Imamagic --> | ||
*By the end of Tuesday 23 NGIs have deployed SAM Update-17 | |||
*Issue with Update-17 and SL6 WN is still being investigated: https://tomtools.cern.ch/jira/browse/SAM-2999 | |||
*Help with MW monitoring instance setup | |||
*Participation at SAM workshop at CERN | |||
*Discussion between EMI PT about open actions and future plans for Message Broker Network. | |||
*False alarms raised in Dashboard by SAM-Nagios instance in Beijing: https://ggus.eu/ws/ticket_info.php?ticket=87276 | |||
= SA1.5 Accounting = | = SA1.5 Accounting = | ||
Line 49: | Line 67: | ||
* Shopping list meeting to prioritise requests for GGUS on Wed, Oct. 10th | * Shopping list meeting to prioritise requests for GGUS on Wed, Oct. 10th | ||
* Preparing next release on 2012-10-24 | * Preparing next release on 2012-10-24 | ||
* discussion on SNOW helpdesk (CERN) closing tickets in case of no user reply for 15 days. EGI has currently no policy like this in place. | |||
=SA1.7 Support= | =SA1.7 Support= | ||
<!-- trompert --> | <!-- trompert --> | ||
* published newsletter | |||
* generated tickets for obsolete s/w | |||
* regular ticket handling, (unknown,a/r, RPI and top-bdii) | |||
* preparation COD F2F | |||
* preparation COD-COO phone conf | |||
== Software Support == | == Software Support == | ||
<!-- A Krenek --> | <!-- A Krenek --> | ||
Intentions to deploy Globus Online raise non-trivial questions | |||
regarding the interface to SEs and catalogues, | |||
see [https://ggus.eu/tech/ticket_show.php?ticket=87004 #87004] and | |||
[https://ggus.eu/tech/ticket_show.php?ticket=87005 #87005]. | |||
We need further input on the intentions to come with feasible architecture. | |||
{| class="wikitable" | |||
! DMSU tickets flow Oct 7 -- Oct 13 | |||
|- | |||
|- | |||
| assigned | |||
| 16 | |||
|- | |||
| back to tpm | |||
| 1 | |||
|- | |||
| reassigned to 3rd level | |||
| 12 | |||
|- | |||
| solved | |||
| 5 | |||
|} | |||
{| class="wikitable" | |||
! open DMSU tickets status | |||
|- | |||
| assigned | |||
| 1 | |||
|- | |||
| in progress | |||
| 3 | |||
|- | |||
| waiting for reply | |||
| 4 | |||
|- | |||
| on hold | |||
| 2 | |||
|} | |||
== Network Support == | == Network Support == | ||
Nothing to report. | |||
=SA1.8 Availability and core services= <!--C. Kanellopoulos--> | |||
*followup of underperforming sites, new automated procedure for followup of underperforming site | |||
*follow up on VOMS migration method (from VOMRS to VOMS). Several issues identified with current implementation (encoding problems, roles transitions etc). Migration method is still under development and testing. | |||
*management of A/R recomputation requests for September 2012 | |||
*Discussions on finalization of EGI.eu OLA document | |||
= | == Documentation == <!-- M. Krakowian --> | ||
<!-- | |||
*restructuring of the operations wiki space | |||
*ongoing work on EGI OLA | |||
*ongoing work on RC certyfication procedure: adding comments from UNICORE and Globus | |||
= Meetings= | = Meetings= | ||
<!--all--> | <!--all--> | ||
* PRACE/EGI/community meeting on data management ( | * PRACE/EGI/community meeting on data management (November/beginning of December) |
Latest revision as of 17:45, 6 January 2015
EGI Inspire Main page |
Inspire reports menu: | Home • | SA1 weekly Reports • | SA1 Task QR Reports • | NGI QR Reports • | NGI QR User support Reports |
Progress of SA1 issues
(SA1) Integration of Albania: the site managers who attended training at TF12 promised progress in the coming months through the creation of the first Albanian site
Milestones/Deliverables
- D4.6 Operations Architecture: incorporated all changes requested through the received reviews. Incorporating last changes after the moderator's review received today
- D4.7: laying down structure of the document. Collection of new input received through the OMB survey after TF12
SA1.1 Activity Management
- assessment of status and progress of COD tickets for unsupported gLite 3.1/3.2 services
- assessment of problems with ARC-CE tests
- assessment of status of EMI 1/2 WN testing and distribution of information to relevant lists
- collection of information of service deployment by VO and discipline
- ops VO membership management
- follow up of a SAM issue with the SL6 worker nodes
- update of the responses analysis for the operations sustainability survey
- meetings: MAPPER phone conference, GlobusOnLine internal meeting, UCB meeting, FedSM meeting, GDB (full day), WLCG MB, FedCloud (chaired), WLCG Fed ID kickoff meeting
SA1.2 Security
- ongoing tuning of security dashboard and security nagios to improve the tool for the followup of sites deploying unsupported gLite 3.1/3.2 software (tests added recently to track the deployment of lcg-ce and CREAM instances that could not found with existing scripts)
- sites reported to be affected by CRITICAL errors are being ticketed by COD
- starting to see some fallout from the middleware version monitoring in the form of questions from sites, otherwise things are quiet
- discussion of duties and effort for running of incident response
SA1.3 Staged rollout
- Release of UMD 2.2.0 on the 9 October.
- Staged rollout of several components towards UMD 1.9.0 on the 29 October, the preliminary list:
- ARC 1.1.1 (all components), CREAM 1.13.5 (1.13.4 from EMI1), WMS 3.3.8, BDII-core 1.4.0, GFAL/lcg_utils 1.13.0, IGE Gridway 5.10.2, IGE-SAGA 1.6.1
- Some of the previous components are already in UMDStore area, and the other are under staged rollout some with reports already delivered.
- Preparation of SW provisioning of EMI2 components towards UMD 2.3.0 around the middle of November
- AMGA 2.3.0: staged rollout has finished and will be passed to UMDStore area closer to the freeze date.
- FTS 2.2.8 and EMIR 1.2.0 are under verification
- CREAM 1.14.1, dCache 2.2.4, UNICORE/X6 5.0.1 and UNICORE HILA 2.3.0 to start verification this week
- http://www.lip.pt/computing/apps/EGI_EA/index.php?option=1 will be used to get the metrics for the QR for SA1.3
SA1.3 Integration
- status assessment of MAPPER integration and proposal of ticket workflows involving EGI and PRACE helpdesk
- organization of EGI/EUDAT/PRACE workshop on data management use cases
SA1.4 Central tools
- By the end of Tuesday 23 NGIs have deployed SAM Update-17
- Issue with Update-17 and SL6 WN is still being investigated: https://tomtools.cern.ch/jira/browse/SAM-2999
- Help with MW monitoring instance setup
- Participation at SAM workshop at CERN
- Discussion between EMI PT about open actions and future plans for Message Broker Network.
- False alarms raised in Dashboard by SAM-Nagios instance in Beijing: https://ggus.eu/ws/ticket_info.php?ticket=87276
SA1.5 Accounting
- Several sites republishing old data with UserDNs.
- Temporary glitch with portal corrupted the tier2 topology and hence the monthly Tier2 report to WLCG. Seems to fix itself.
- NGI_CH started publishing in production from two ARC sites via SGAS..
- A new site in OSG highlighted a sub-optimal synchronisation with REBUS in the new publishing route.
SA1.6 Helpdesk
- Shopping list meeting to prioritise requests for GGUS on Wed, Oct. 10th
- Preparing next release on 2012-10-24
- discussion on SNOW helpdesk (CERN) closing tickets in case of no user reply for 15 days. EGI has currently no policy like this in place.
SA1.7 Support
- published newsletter
- generated tickets for obsolete s/w
- regular ticket handling, (unknown,a/r, RPI and top-bdii)
- preparation COD F2F
- preparation COD-COO phone conf
Software Support
Intentions to deploy Globus Online raise non-trivial questions regarding the interface to SEs and catalogues, see #87004 and #87005. We need further input on the intentions to come with feasible architecture.
DMSU tickets flow Oct 7 -- Oct 13 | |
---|---|
assigned | 16 |
back to tpm | 1 |
reassigned to 3rd level | 12 |
solved | 5 |
open DMSU tickets status | |
---|---|
assigned | 1 |
in progress | 3 |
waiting for reply | 4 |
on hold | 2 |
Network Support
Nothing to report.
SA1.8 Availability and core services
- followup of underperforming sites, new automated procedure for followup of underperforming site
- follow up on VOMS migration method (from VOMRS to VOMS). Several issues identified with current implementation (encoding problems, roles transitions etc). Migration method is still under development and testing.
- management of A/R recomputation requests for September 2012
- Discussions on finalization of EGI.eu OLA document
Documentation
- restructuring of the operations wiki space
- ongoing work on EGI OLA
- ongoing work on RC certyfication procedure: adding comments from UNICORE and Globus
Meetings
- PRACE/EGI/community meeting on data management (November/beginning of December)