Difference between revisions of "ARGO"

From EGIWiki
Jump to navigation Jump to search
Line 74: Line 74:

=== Profiles for RC monitoring ===
=== Profiles for RC monitoring ===
*[https://poem.egi.eu/poem/admin/poem/profile/2/ ARGO_MON]  
*[https://poem.egi.eu/poem/admin/poem/profile/1/ ARGO_MON]  
** Tests for monitoring of all EGI services.
** Tests for monitoring of all EGI services.
** [[ROC_SAM_Tests |ROC Tests description]]
** [[ROC_SAM_Tests |ROC Tests description]]

Revision as of 13:53, 25 April 2018

Main EGI.eu operations services Support Documentation Tools Activities Performance Technology Catch-all Services Resource Allocation Security

Tools menu: Main page Instructions for developers AAI Proxy Accounting Portal Accounting Repository AppDB ARGO GGUS GOCDB
Message brokers Licenses OTAGs Operations Portal Perun EGI Collaboration tools LToS EGI Workload Manager

Tool name ARGO
Tool Category and description Service Monitoring for Availability and Reliability
Tool url https://argoeu.github.io
Email argo-ggus-support@grnet.gr
GGUS Support unit ARGO/SAM EGI Support
GOC DB entry https://goc.egi.eu/portal/index.php?Page_Type=Site&id=641
Requirements tracking - EGI tracker https://rt.egi.eu/rt/Dashboards/5544/SAM-Requirements
Issue tracking - Developers tracker https://github.com/ARGOeu/ARGO/issues
Release schedule https://github.com/ARGOeu/ARGO/milestones
Release notes TBD
Roadmap TBD
Related OLA https://documents.egi.eu/public/ShowDocument?docid=2170
Test instance url http://cclavoisier04.in2p3.fr:8080/lavoisier
Documentation https://argoeu.github.io/overview/
License Apache 2
Source code https://github.com/ARGOeu/

Change, Release and Deployment

This sections are providing detailed agreement in terms of requirements gathering, release and deployment of the tool which extend Instructions for Operations Tools teams


ARGO monitoring engine

ARGO monitoring engine consists of the following central instances:

POEM Profiles

Profiles for RC monitoring

Profile for Cloud RC monitoring

Profiles for Operations Tools monitoring


ARGO tests

Tests on ARGO MON instances are the one which frameworks includes in the ARGO configuration.

List of tests:

Operations tests

Tests on Operations Portal are the ones used for raising alarms for ROD and Operations teams. Operations portal does not execute these tests, but receives alarms from NGI/ROC SAM instances. Operations Portal contains list of the probes used for alarms and others are filtered.

The procedure for adding a new probe can be found here PROC06.

The list of tests can be found here - Operations SAM tests.

Availability tests

Set of tests used for calculating availability and reliability of sites and services. The A/R calculation is related to the OLA. As in case of Operations Portal, availability calculation component receives results from NGI/ROC SAM instances.

TSA1.8 proposes a change in avail calculation (which probe results count in it) and the OMB approves.

The list of tests can be found here - Availability SAM tests.