Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "EGI-InSPIRE:SA1.4-QR4"

From EGIWiki
Jump to navigation Jump to search
Line 35: Line 35:
Note. This is a detailed account of progress over the previous quarter of activities within  the  task.  
Note. This is a detailed account of progress over the previous quarter of activities within  the  task.  
-->
-->
GOCDB has experienced problems during quarter three caused by bad database hardware. On January 27th new instance was deployed and all tools were requested to validate the test instance. As the validation was successful migration of GOCDB to new hardware was performed on February 2nd 2011.


Deployment plans of NGI instances of individual operational tools were finalized. Relevant ticket ([https://rt.egi.eu/rt/Ticket/Display.html?id=831 RT #831]) is closed.
Deployment plans of NGI instances of individual operational tools were finalized. Relevant ticket ([https://rt.egi.eu/rt/Ticket/Display.html?id=831 RT #831]) is closed.

Revision as of 00:06, 27 April 2011

1. Task Meetings

There are no specific SA1.4 meetings. It was agreed to discuss all deployment issues with operational tool representatives at the JRA1 meetings. Below is the list of JRA1 meetings and subjects relevant for SA1.4 which were discussed.

Date (dd/mm/yyyy) Url Indico Agenda Title Outcome
17/02/2011 https://www.egi.eu/indico/conferenceDisplay.py?confId=352 InSPIRE-JRA1 phone conf SAM Update-09 release analysis and SAM DMSU activity setup.
16/03/2011 https://www.egi.eu/indico/conferenceDisplay.py?confId=427 InSPIRE-JRA1 phone conf Operational tools updates deployment.
28/04/2010 https://www.egi.eu/indico/conferenceDisplay.py?confId=426 InSPIRE-JRA1 phone conf Handling of nodes in non-production state.

2. Main Achievements

GOCDB has experienced problems during quarter three caused by bad database hardware. On January 27th new instance was deployed and all tools were requested to validate the test instance. As the validation was successful migration of GOCDB to new hardware was performed on February 2nd 2011.

Deployment plans of NGI instances of individual operational tools were finalized. Relevant ticket (RT #831) is closed.

Decommission of gridops.org domain was postponed due to external dependencies (i.e. Top BDII). Decommission of gridops.org domain was rescheduled for June 30th 2011.

Two new version of Operations portal were released in this quarter: 2.4 on November 17th and 2.4.1 on December 16th. Detailed list of new features can be found in JRA1 section. At the end of the quarter there were four NGI instances: NGI_BY, NGI_CZ, NGI_GRNET and NGI_IBERGRID.

SAM/Nagios deployment of NGI instances continued. Two big ROCs finalized migration to NGI instances:

  • Asia Pacific: validated on February 17th 2011
  • Italy: validated on April 4th 2011

At the end of the quarter following SAM/Nagios instances were in production:

  • 24 NGI instances covering 35 EGI partners
  • 3 ROC instances covering 4 EGI partners
  • 1 project instances covering 1 EGI partners
  • 3 external ROC instances covering the following regions: Canada, IGALC and LA.

Detailed list of SAM/Nagios instances can be found on the following page: SAM Instances.

Two procedures relevant for operational tools were approved at the Operations Management Board on March 15th 2011:

3. Issues and Mitigation

Issue Description Mitigation Description
High availability of central operational tools is needed. GOCDB: dynamic loadbalancing DNS setup is provided for the address goc.egi.eu, secondary instance will be set up in Fraunhofer institute in the next quarter.
SAM: April release of SAM will contain option to install secondary instance, this will be deployed based on depending on NGI size and resources.
Operations, accounting portal and metrics portal: services are deployed on virtualization platforms, backups performed regularily, recovery in case of failure can be performed quickly.

4. Plans for the next period

Central MyEGI instance will reach production quality.

Decommission of the old CIC portal (cic.egi.eu) will be performed between April and June 2011 depending on development of the new Operations portal. The main remaining functionalities which need to be migrated to Operations Portal are broadcast and VO ID cards.

Remaining procedures and manuals related to operational tools will be finalized and presented for approval at the OMB in the next quarter.

Contribute and follow discussions of the new task force on regionalization. Update deployment plans of individual NGI instances of tools which will provide regionalized versions in the following period.

Track deployment and validation of remaining regional and NGI Nagioses. Deployment plans of the remaining NGIs are the following:

  • UK and Ireland plan to perform NGI creation in the next quarter.

Track development of probes for monitoring operational tools and integration into ops-monitor Nagios instance.

Track and perform planned tests of failover configurations of centralized tools.