Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @


From EGIWiki
Jump to navigation Jump to search
Main operations services Support Documentation Tools Activities Performance Technology Catch-all Services Resource Allocation Security

Inspire reports menu: Home SA1 weekly Reports SA1 Task QR Reports NGI QR Reports NGI QR User support Reports

1. Task Meetings

Date (dd/mm/yyyy) Url Indico Agenda Title Outcome
26/11/2012 InSPIRE-JRA1 phone conf Tools status update
17/12/2012 InSPIRE-JRA1 phone conf Tools status update

2. Main Achievements

Assessment of progress in 2012. and plans for 2013. provided by all product teams.

Two upgrades of ActiveMQ brokers performed in November: 7th to version 5.5.1-fuse-08-15 and 20th to version ActiveMQ 5.5.1-fuse-09-16. A complete changelog can be found at the following address:

Operations portal Prototype 3.0.0 was deployed on December 18th: All interested parties were requested to provide feedback.

Since the current version of Operations portal provides NGI view regional instances will be decommissioned. Currently there is only one instance left: NGI_GRNET.

SAM Update-19 staged rollout successfully finished and released to production on November 23th. SAM Update-20 staged rollout was finished in January, but release to production is still pending WLCG validation. Ireland NGI was decommissioned in this quarter. ROC_IGALC was decommissioned, remaining sites were moved to ROC_LA. At the end of the quarter following SAM/Nagios instances were in production:

  • 28 NGI instances covering 38 EGI partners
  • 3 ROC instances covering 3 EGI partners
  • 2 external ROC instances covering the following regions: Canada and LA.

Detailed list of SAM/Nagios instances can be found on the following page: SAM Instances.

Integration of the ops-monitor SAM instance with the central ACE was finalized in this quarter. All tests are documented on the following page: OPS-MONITOR_profile_SAM_tests.

New centralized SAM instance for monitoring middleware versions was deployed: All tests ran by this instance are documented on the following page: MW_Nagios_tests. The instance was integrated with the Operations Portal and the following tests ran by this instance were added to operational tests:

  • eu.egi.sec.DPM - checks if SRM DPM service endpoint is using gLite 3.2 middleware.
  • eu.egi.sec.LFC - checks if LFC service endpoint is using gLite 3.2 middleware.
  • eu.egi.sec.WN - checks if WN is using gLite 3.2 (or older) middleware.
  • org.nagios.GLUE2-Check - checks if the site BDII is publishing GLUE2 information.

New centralized SAM instance for monitoring cloud resources was deployed: Integration of cloud resources with all operational tools is documented on the following page: Fedcloud-tf:WorkGroups:Scenario5.

3. Issues and Mitigation

Issue Description Mitigation Description

4. Plans for the next period

APEL SSM will be rolled into production in the following quarter. In order to enable sites to publish accounting data securely to production message broker network gLite-APEL nodes need to be authorized. Component for retrieving list of gLite-APELs DNs from GOCDB will be implemented and deployed on production message brokers.

A new version of Operations Portal containing availabilities/reliabilities reporting system from perspective of VOs will be deployed in production.

Finalize decommission of the remaining NGI instance of Operational Portal (NGI_GRNET).

The next version of SAM will integrate Nagios probes provided by EMI. Once released to production all NGI instances will need to be reinstalled since upgrade from gLite UI to EMI is not supported.

Development of probes for two centralized SAM instances for middleware monitoring and cloud monitoring will continue. Cloud monitoring instance will be moved to domain (