Difference between revisions of "PROC05 Validation of Operations Centre Nagios"
Jump to navigation
Jump to search
Line 16: | Line 16: | ||
}} | }} | ||
Registration: | |||
* add it to GOCDB and to service group NGI_XX_SERVICES | * add it to GOCDB and to service group NGI_XX_SERVICES | ||
* open ticket for SAM/Nagios to add it to http://gridops.cern.ch/config/nagios-roles.conf | * open ticket for SAM/Nagios to add it to http://gridops.cern.ch/config/nagios-roles.conf | ||
* open ticket for Operations Portal guys to enable receiving alarms from it | * open ticket for Operations Portal guys to enable receiving alarms from it | ||
Validation: | |||
* instance must pass all tests on ops-monitor instance | |||
* all internal tests (assigned to nagios server) must be OK | |||
* ROD must check status of services and they shouldn't see any errors that cannot be justified with actual site error. | |||
* all this should run for at least 7 days so that we can see that local MyEGI data is fine |
Revision as of 12:06, 21 May 2013
Main | EGI.eu operations services | Support | Documentation | Tools | Activities | Performance | Technology | Catch-all Services | Resource Allocation | Security |
Documentation menu: | Home • | Manuals • | Procedures • | Training • | Other • | Contact ► | For: | VO managers • | Administrators |
Title | Validation of a Operations Centre Nagios |
Document link | https://twiki.cern.ch/twiki/bin/view/EGEE/ValidateROCNagios |
Last modified | 1.0 |
Policy Group Acronym | OMB |
Policy Group Name | Operations Management Board |
Contact Group | Emir Imamagic |
Document Status | Approved |
Approved Date | 10 August 2010 |
Procedure Statement | The purpose of this document is to define validation procedure of a Operations Centre Nagios |
Owner | Owner of procedure |
Registration:
- add it to GOCDB and to service group NGI_XX_SERVICES
- open ticket for SAM/Nagios to add it to http://gridops.cern.ch/config/nagios-roles.conf
- open ticket for Operations Portal guys to enable receiving alarms from it
Validation:
- instance must pass all tests on ops-monitor instance
- all internal tests (assigned to nagios server) must be OK
- ROD must check status of services and they shouldn't see any errors that cannot be justified with actual site error.
- all this should run for at least 7 days so that we can see that local MyEGI data is fine