Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

EGI Core activities:2013-bidding Operations Portal

From EGIWiki
Jump to navigation Jump to search


Go back to the activity list.

  • Service name:
  • Service category: Operations
  • Service type: Coordination, operation and maintenance

The Operations Portal provides VO management functions and other capabilities which support the daily operations of EGI.

Introduction

EGI.eu provides a central portal for the operations community that offers a bundle of different capabilities, such as the broadcast tool, VO management facilities, a security dashboard and an operations dashboard that is used to display information about failing monitoring probes and to open tickets to the Resource Centres affected. The dashboard also supports the central grid oversight activities. It is fully interfaced with the EGI Helpdesk and the monitoring system through messaging. It is a critical component as it is used by all EGI Operations Centres to provide support to the respective Resource Centres. The Operations Portal provides tools supporting the daily running of operations of the entire infrastructure: grid oversight, security operations, VO management, broadcast, availability reporting.

Technical description

The Operations Portal provides different capabilities:

  • The detection and the follow-up of incidents on the different sites of the EGI infrastructure
  • The repository for the static information related to Virtual Organizations
  • The broadcast tool
  • A visualisation (charts) and notification (emails or rss) system related to the downtimes impacting the services, the sites, the NGIs or the VO
  • A reporting and computing system giving the availabilities and reliabilities of the TOP-BDII services, of the sites and of the services of a VO
  • A user tracking tool
  • Metrics and charts

The architecture is composed of three modules:

  • A database – to store information related to the users or the VO - namely MySQL
  • A web module – graphical user interface – which is currently integrated into the Symfony and bootstrap frameworks
  • A Data Aggregation and Unification Service named Lavoisier

Lavoisier is the component used to store, consolidate and “feed” data into the web application. This module provides information from various sources without the portal being directly dependent on those information sources thanks to a caching mechanism. This indeed protects us from intermittent failures of information sources.

This portal has been conceived and built as an integration platform of different and heterogeneous sources of information. Lavoisier is used to integrate and harmonized these different data sources.

The different components are integrated in a high available mode:

  • the Mysql Database is integrated in a cluster service;
  • the web module is also integrated in a cluster service;
  • the configuration of Lavoisier is stored in a subversion repository and the service is easily deployable on the fly in case of failure;
  • Different instances of the database, web module and Lavoisier are deployed.

This service includes the following components.

Coordination

This activity is responsible of the coordination of the system operation and upgrade activities with those partners that are in charge of operating other systems that depend on it.

Support

Support through the EGI helpdesk to EGI.eu, VO Managers, EGI CSIRT, Resource Centre and NGI operators for the usage of the various functional modules provided by the tool.

Support hours: eight hours a day (9-17 CE(S)T), Monday to Friday – excluding public holidays of the hosting organization.

Operation

  • Daily running of the system
  • Provisioning of a high availability configuration
  • A test infrastructure to verify interoperability and the impact of software upgrades on depending systems

Maintenance

This activity includes:

  • core refactoring, bug fixing, proactive maintenance, improvement of the system
  • coordination of software maintenance activities with other technology providers that provide software for the EGI Core Infrastructure or remote systems deployed by integrated and peer infrastructures that interoperate with the Operations Portal.
  • Maintenance of probes to test the functionality of the service
  • Requirements gathering

Service level targets