Difference between revisions of "OPS-MONITOR profile SAM tests"
Jump to navigation
Jump to search
(8 intermediate revisions by 3 users not shown) | |||
Line 3: | Line 3: | ||
{{TOC_right}} | {{TOC_right}} | ||
[[Category:SAM]] | [[Category:SAM]] | ||
'''IMPORTANT:''' Description of metrics is not maintained on this page anymore. Please use POEM directly to find relevant information: | |||
* Services: https://poem.egi.eu/poem/admin/poem/public_service/ | |||
* Profiles: https://poem.egi.eu/poem/admin/poem/public_profile/ | |||
* Metrics: https://poem.egi.eu/poem/admin/poem/public_metric/ | |||
* Probes: https://poem.egi.eu/poem/admin/poem/public_probe/ | |||
This table lists tests used for monitoring operational tools. Tests are executed on the | {{Template:Deprecated}} | ||
This table lists tests used for monitoring operational tools. Tests are executed on the central ARGO instances. Alarms for these tests are opened directly in the Operations Portal Dashboard. POEM profile used on this instance is [http://poem.egi.eu/poem/admin/poem/profile/8/ OPS-MONITOR]. | |||
{| class="wikitable" | {| class="wikitable" | ||
Line 12: | Line 19: | ||
! Description | ! Description | ||
|- | |- | ||
| align="center" | | | align="center" |eu.egi.cloud.Perun-Check | ||
| 5 minutes | |||
| Probe connects to Perun and checks the status by using internal Perun interface. | |||
| | |||
| | |||
|- | |- | ||
| align="center" |org.activemq.Network-Topic | | align="center" |org.activemq.Network-Topic | ||
Line 55: | Line 46: | ||
| 5min | | 5min | ||
| Test checks if it possible to send/receive message on a queue /queue/monitor.test.$_SERVICESERVER$.$HOSTNAME$.stompsslby using STOMP over SSL protocol. | | Test checks if it possible to send/receive message on a queue /queue/monitor.test.$_SERVICESERVER$.$HOSTNAME$.stompsslby using STOMP over SSL protocol. | ||
|- | |||
| align="center" |org.nagios.ARGOWeb-AR | |||
| 5min | |||
| Checks the availability of the ARGO web portal (A/R interface). It checks availability of web page /lavoisier/ngi_reports?accept=csv and checks for pattern NGI_HR. Additional documentation: http://nagiosplugins.org/man/check_http. | |||
|- | |||
| align="center" |org.nagios.ARGOWeb-Status | |||
| 5min | |||
| Checks the availability of the ARGO web portal (Metric Status interface). It checks availability of web page /lavoisier/status_report-site?accept=csv and checks for pattern IN2P3-CC. Additional documentation: http://nagiosplugins.org/man/check_http. | |||
|- | |- | ||
| align="center" |org.nagios.GOCDB-PortCheck | | align="center" |org.nagios.GOCDB-PortCheck | ||
Line 90: | Line 89: | ||
| align="center" |org.nagiosexchange.GSTAT-WebCheck | | align="center" |org.nagiosexchange.GSTAT-WebCheck | ||
| 5min | | 5min | ||
| Checks if web site gstat.egi.eu is available. Additional documentation: http://nagiosplugins.org/man/check_http. | | Checks if web site gstat.egi.eu is available. It uses web page /gstat/summary/json/. Additional documentation: http://nagiosplugins.org/man/check_http. | ||
|- | |- | ||
| align="center" |org.nagiosexchange.MetricsPortal-WebCheck | | align="center" |org.nagiosexchange.MetricsPortal-WebCheck | ||
Line 119: | Line 118: | ||
| 5min | | 5min | ||
| Checks if web site operations-portal.egi.eu is available. Additional documentation: http://nagiosplugins.org/man/check_http. | | Checks if web site operations-portal.egi.eu is available. Additional documentation: http://nagiosplugins.org/man/check_http. | ||
|- | |||
| align="center" |org.nagiosexchange.Portal-WebCheck | |||
| 5min | |||
| Checks if web site (egi.Portal service type) is available. Additional documentation: http://nagiosplugins.org/man/check_http. | |||
|- | |- | ||
| align="center" |org.nagiosexchange.TMP-WebCheck | | align="center" |org.nagiosexchange.TMP-WebCheck |
Latest revision as of 16:02, 10 October 2019
Main | EGI.eu operations services | Support | Documentation | Tools | Activities | Performance | Technology | Catch-all Services | Resource Allocation | Security |
Tools menu: | • Main page | • Instructions for developers | • AAI Proxy | • Accounting Portal | • Accounting Repository | • AppDB | • ARGO | • GGUS | • GOCDB |
• Message brokers | • Licenses | • OTAGs | • Operations Portal | • Perun | • EGI Collaboration tools | • LToS | • EGI Workload Manager |
IMPORTANT: Description of metrics is not maintained on this page anymore. Please use POEM directly to find relevant information:
- Services: https://poem.egi.eu/poem/admin/poem/public_service/
- Profiles: https://poem.egi.eu/poem/admin/poem/public_profile/
- Metrics: https://poem.egi.eu/poem/admin/poem/public_metric/
- Probes: https://poem.egi.eu/poem/admin/poem/public_probe/
This article is Deprecated and should no longer be used, but is still available for reasons of reference. |
This table lists tests used for monitoring operational tools. Tests are executed on the central ARGO instances. Alarms for these tests are opened directly in the Operations Portal Dashboard. POEM profile used on this instance is OPS-MONITOR.
Nagios test | Frequency | Description |
---|---|---|
eu.egi.cloud.Perun-Check | 5 minutes | Probe connects to Perun and checks the status by using internal Perun interface. |
org.activemq.Network-Topic | 5min | Test sends a message on topic global.monitor.test.topicnetwork.$HOSTNAME$ and checks if it can be received on all brokers. |
org.activemq.Network-VirtualDestination | 5min | Test sends a message on virtual destination global.monitor.test.virtualdestinations.$HOSTNAME$ and checks if it can be received on all brokers. |
org.activemq.OpenWire | 5min | Test checks if it possible to send/receive message on a queue /queue/monitor.test.$_SERVICESERVER$.$HOSTNAME$.openwire by using OpenWire protocol. |
org.activemq.OpenWireSSL | 5min | Test checks if it possible to send/receive message on a queue /queue/monitor.test.$_SERVICESERVER$.$HOSTNAME$.openwiressl by using OpenWire over SSL protocol. |
org.activemq.STOMP | 5min | Test checks if it possible to send/receive message on a queue /queue/monitor.test.$_SERVICESERVER$.$HOSTNAME$.stomp by using STOMP protocol. |
org.activemq.STOMPSSL | 5min | Test checks if it possible to send/receive message on a queue /queue/monitor.test.$_SERVICESERVER$.$HOSTNAME$.stompsslby using STOMP over SSL protocol. |
org.nagios.ARGOWeb-AR | 5min | Checks the availability of the ARGO web portal (A/R interface). It checks availability of web page /lavoisier/ngi_reports?accept=csv and checks for pattern NGI_HR. Additional documentation: http://nagiosplugins.org/man/check_http. |
org.nagios.ARGOWeb-Status | 5min | Checks the availability of the ARGO web portal (Metric Status interface). It checks availability of web page /lavoisier/status_report-site?accept=csv and checks for pattern IN2P3-CC. Additional documentation: http://nagiosplugins.org/man/check_http. |
org.nagios.GOCDB-PortCheck | 5min | Checks if goc.egi.eu TCP port 443 is open. Additional documentation: http://nagiosplugins.org/man/check_tcp. (TODO: remove this check, GOCDB-WebCheck is sufficient). |
org.nagiosexchange.Broker-BDII | 6h | Checks if broker service endpoint (GlueServiceEndpoint=*$HOSTNAME$*) is published in the top BDII. |
org.nagiosexchange.AccountingPortal-WebCheck | 5min | Checks if web site accounting.egi.eu/test.php is available. Additional documentation: http://nagiosplugins.org/man/check_http. (TODO: add extra checks of output from the web site). |
org.nagiosexchange.AppDB-WebCheck | 5min | Checks if web site appdb.egi.eu is available. Additional documentation: http://nagiosplugins.org/man/check_http. |
org.nagiosexchange.CRM-WebCheck | 5min | Checks if web site vt.lip.pt is available. Additional documentation: http://nagiosplugins.org/man/check_http. |
org.nagiosexchange.GGUS-WebCheck | 5min | Checks if web site ggus.eu/nagios/service.php is available. The PHP page performs internal GGUS checks and report back status of whole GGUS system. Additional documentation: http://nagiosplugins.org/man/check_http. |
org.nagiosexchange.GOCDB-PI | 5min | Checks the availability of the GOCDB PI. It checks availability of web page goc.egi.eu/gocdbpi/public/?method=get_site_list&sitename=IN2P3-CC. Additional documentation: http://nagiosplugins.org/man/check_http. |
org.nagiosexchange.GOCDB-WebCheck | 5min | Checks if web site goc.egi.eu/portal/GOCDB_monitor/ops_monitor_check.php is available. The PHP page performs internal GOCDB checks and report back status of whole GOCDB system. Additional documentation: http://nagiosplugins.org/man/check_http. |
org.nagiosexchange.GSTAT-WebCheck | 5min | Checks if web site gstat.egi.eu is available. It uses web page /gstat/summary/json/. Additional documentation: http://nagiosplugins.org/man/check_http. |
org.nagiosexchange.MetricsPortal-WebCheck | 5min | Checks if web site metrics.egi.eu is available. Additional documentation: http://nagiosplugins.org/man/check_http. |
org.nagiosexchange.MyEGIWebInterface | 5min | Checks if web site /myegi on a given host is available. This check is used for all SAM instance (central, NGI, VO). Additional documentation: http://nagiosplugins.org/man/check_http. |
org.nagiosexchange.NagiosHostSummary | 5min | Test checks host summary status by using Nagios web interface. |
org.nagiosexchange.NagiosProcess | 5min | Test checks status of Nagios process by using Nagios web interface. |
org.nagiosexchange.NagiosServiceSummary | 5min | Test checks service summary status by using Nagios web interface. |
org.nagiosexchange.NagiosWebInterface | 5min | Test checks availability of SAM Nagios interface. |
org.nagiosexchange.OpsPortal-WebCheck | 5min | Checks if web site operations-portal.egi.eu is available. Additional documentation: http://nagiosplugins.org/man/check_http. |
org.nagiosexchange.Portal-WebCheck | 5min | Checks if web site (egi.Portal service type) is available. Additional documentation: http://nagiosplugins.org/man/check_http. |
org.nagiosexchange.TMP-WebCheck | 5min | Checks if web site egitraining.esc.rl.ac.uk is available. Additional documentation: http://nagiosplugins.org/man/check_http. |