Difference between revisions of "PROC05 Validation of Operations Centre Nagios"
Jump to navigation
Jump to search
(10 intermediate revisions by 4 users not shown) | |||
Line 1: | Line 1: | ||
{{Template:Op menubar}} {{Template:Doc_menubar}} {{TOC_right}} | {{Template:Op menubar}} {{Template:Doc_menubar}} {{TOC_right}} | ||
[[Category:Deprecated]] | |||
{| style="border:1px solid black; background-color:lightgrey; color: black; padding:5px; font-size:140%; width: 90%; margin: auto;" | |||
| style="padding-right: 15px; padding-left: 15px;" | | |||
|[[File:Alert.png]] This article is '''Deprecated''' and should no longer be used, but is still available for reasons of reference. | |||
|} | |||
{{Ops_procedures | {{Ops_procedures | ||
|Doc_title = Validation of Operations Centre Nagios | |Doc_title = Validation of Operations Centre Nagios | ||
|Doc_link = [[PROC05| https://wiki.egi.eu/wiki/PROC05]] | |Doc_link = [[PROC05| https://wiki.egi.eu/wiki/PROC05]] | ||
|Version = | |Version = 19 August 2014 | ||
|Policy_acronym = OMB | |Policy_acronym = OMB | ||
|Policy_name = Operations Management Board | |Policy_name = Operations Management Board | ||
|Contact_group = | |Contact_group = operations@egi.eu | ||
|Doc_status = Approved | |Doc_status = Approved | ||
|Approval_date = 10 August 2010 | |Approval_date = 10 August 2010 | ||
|Procedure_statement = The purpose of this document is to define validation procedure of Operations Centre Nagios | |Procedure_statement = The purpose of this document is to define validation procedure of Operations Centre Nagios | ||
|Owner = Alessandro Paolini | |||
}} | }} | ||
Line 28: | Line 34: | ||
|- | |- | ||
! <br> | ! <br> | ||
! | ! <br> | ||
! Responsible | ! Responsible | ||
! Action | ! Action | ||
|- valign="top" | |- valign="top" | ||
| 1 | | 1 | ||
| | | <br> | ||
| NGI | | NGI | ||
| Add instance to [https://goc.egi.eu/ GOC DB] and to [[NGI services in GOCDB|service group NGI_XX_SERVICES]] | | Add instance to [https://goc.egi.eu/ GOC DB] and to [[NGI services in GOCDB|service group NGI_XX_SERVICES]] | ||
|- valign="top" | |- valign="top" | ||
| 2 | | 2 | ||
| <br> | |||
| NGI | |||
| Open GGUS ticket in [http://ggus.eu/ GGUS system] for SAM/Nagios SU to add your NAGIOS instance to http://mon.egi.eu/nagios-roles.conf | |||
|- valign="top" | |||
| 3 | |||
| | | | ||
| NGI | | NGI | ||
| Open GGUS ticket in [http://ggus.eu/ GGUS system] | | Open GGUS ticket in [http://ggus.eu/ GGUS system] forOperations to add your Nagios instance to <span style="vertical-align: middle;">[https://goc.egi.eu/portal/index.php?Page_Type=Service_Group&id=1206 EGI_CoreSAM] service group | ||
</span> | |||
|- valign="top" | |- valign="top" | ||
| rowspan="3" | | | rowspan="3" | 4<br> <br> <br> | ||
| 1 | | 1 | ||
| NGI | | NGI | ||
| Check if your Nagios instance pass all tests on [https://opsmon.egi.eu/nagios/ | | Check if your Nagios instance pass all tests on [https://opsmon.egi.eu/nagios/ Ops-Monitor instance] for at least 7 days. | ||
|- valign="top" | |- valign="top" | ||
| 2 | | 2 | ||
Line 55: | Line 67: | ||
| Should check status of site's services to find errors that cannot be justified with actual site error for at least 7 days. | | Should check status of site's services to find errors that cannot be justified with actual site error for at least 7 days. | ||
|- valign="top" | |- valign="top" | ||
| | | 5 | ||
| | | <br> | ||
| NGI | | NGI | ||
| Open GGUS ticket in [http://ggus.eu/ GGUS system] for Operations Portal SU to enable receiving alarms from your NAGIOS instance | | Open GGUS ticket in [http://ggus.eu/ GGUS system] for Operations Portal SU to enable receiving alarms from your NAGIOS instance | ||
|} | |} | ||
= Revision history = | |||
{| class="wikitable" | |||
|- | |||
! Version | |||
! Authors | |||
! Date | |||
! Comments | |||
|- | |||
| | |||
| Alessandro Paolini | |||
| 2016-06-07 | |||
| "EGI Operations Support" was decommissioned, changed all the references to "Operations" | |||
|- | |||
| | |||
| M. Krakowian | |||
| 19 August 2014 | |||
| Change contact group -> Operations support | |||
|} | |||
<br> | |||
<br> | |||
[[Category:Operations_Procedures]] | [[Category:Operations_Procedures]] |
Latest revision as of 17:32, 30 July 2021
Main | EGI.eu operations services | Support | Documentation | Tools | Activities | Performance | Technology | Catch-all Services | Resource Allocation | Security |
Documentation menu: | Home • | Manuals • | Procedures • | Training • | Other • | Contact ► | For: | VO managers • | Administrators |
This article is Deprecated and should no longer be used, but is still available for reasons of reference. |
Title | Validation of Operations Centre Nagios |
Document link | https://wiki.egi.eu/wiki/PROC05 |
Last modified | 19 August 2014 |
Policy Group Acronym | OMB |
Policy Group Name | Operations Management Board |
Contact Group | operations@egi.eu |
Document Status | Approved |
Approved Date | 10 August 2010 |
Procedure Statement | The purpose of this document is to define validation procedure of Operations Centre Nagios |
Owner | Alessandro Paolini |
Overview
The document describes the process of how to register and validate Operations Centre Nagios instance.
Definitions
Please refer to the EGI Glossary for the definitions of the terms used in this procedure.
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", “MAY", and "OPTIONAL" in this document are to be interpreted as described in RFC 2119.
Registration and Validation Steps
Responsible | Action | ||
---|---|---|---|
1 | NGI | Add instance to GOC DB and to service group NGI_XX_SERVICES | |
2 | NGI | Open GGUS ticket in GGUS system for SAM/Nagios SU to add your NAGIOS instance to http://mon.egi.eu/nagios-roles.conf | |
3 | NGI | Open GGUS ticket in GGUS system forOperations to add your Nagios instance to EGI_CoreSAM service group
| |
4 |
1 | NGI | Check if your Nagios instance pass all tests on Ops-Monitor instance for at least 7 days. |
2 | NGI | Check all internal tests (assigned to NAGIOS server) which must be OK for at least 7 days. | |
3 | NGI ROD team | Should check status of site's services to find errors that cannot be justified with actual site error for at least 7 days. | |
5 | NGI | Open GGUS ticket in GGUS system for Operations Portal SU to enable receiving alarms from your NAGIOS instance |
Revision history
Version | Authors | Date | Comments |
---|---|---|---|
Alessandro Paolini | 2016-06-07 | "EGI Operations Support" was decommissioned, changed all the references to "Operations" | |
M. Krakowian | 19 August 2014 | Change contact group -> Operations support |