Difference between revisions of "SAM"
Jump to navigation
Jump to search
Line 28: | Line 28: | ||
* [http://grid-monitoring.cern.ch/myegi/sam-pi/metrics_in_profiles?vo_name=ops&profile_name=ROC_OPERATORS ROC_OPERATORS] | * [http://grid-monitoring.cern.ch/myegi/sam-pi/metrics_in_profiles?vo_name=ops&profile_name=ROC_OPERATORS ROC_OPERATORS] | ||
* [http://grid-monitoring.cern.ch/myegi/sam-pi/metrics_in_profiles?vo_name=ops&profile_name=WLCG_CREAM_CRITICAL WLCG_CREAM_CRITICAL] | * [http://grid-monitoring.cern.ch/myegi/sam-pi/metrics_in_profiles?vo_name=ops&profile_name=WLCG_CREAM_CRITICAL WLCG_CREAM_CRITICAL] | ||
* [http://grid-monitoring.cern.ch/myegi/sam-pi/metrics_in_profiles?vo_name=ops&profile_name=WLCG_CREAM_LCGCE_CRITICAL WLCG_CREAM_LCGCE_CRITICAL] (profile used for EGI Availability/Reliability computation | * [http://grid-monitoring.cern.ch/myegi/sam-pi/metrics_in_profiles?vo_name=ops&profile_name=WLCG_CREAM_LCGCE_CRITICAL WLCG_CREAM_LCGCE_CRITICAL] (profile used for EGI Availability/Reliability computation - OPS VO) | ||
* [http://grid-monitoring.cern.ch/myegi/sam-pi/metrics_in_profiles?vo_name=ops&profile_name=WLCG_CRITICAL WLCG_CRITICAL] | * [http://grid-monitoring.cern.ch/myegi/sam-pi/metrics_in_profiles?vo_name=ops&profile_name=WLCG_CRITICAL WLCG_CRITICAL] | ||
* [http://grid-monitoring.cern.ch/myegi/sam-pi/metrics_in_profiles?vo_name=ops&profile_name=WLCG_CRITICAL_TEST WLCG_CRITICAL_TEST] | * [http://grid-monitoring.cern.ch/myegi/sam-pi/metrics_in_profiles?vo_name=ops&profile_name=WLCG_CRITICAL_TEST WLCG_CRITICAL_TEST] |
Revision as of 14:30, 21 October 2011
Main | EGI.eu operations services | Support | Documentation | Tools | Activities | Performance | Technology | Catch-all Services | Resource Allocation | Security |
Tools menu: | • Main page | • Instructions for developers | • AAI Proxy | • Accounting Portal | • Accounting Repository | • AppDB | • ARGO | • GGUS | • GOCDB |
• Message brokers | • Licenses | • OTAGs | • Operations Portal | • Perun | • EGI Collaboration tools | • LToS | • EGI Workload Manager |
The Service Availability Monitoring (SAM) system is used to monitor the resources within the production infrastructure. SAM monitoring data is used for calculation of availability and reliability of grid sites. It includes the following components:
- probes: a test execution framework (based on the open source monitoring framework Nagios) and the Nagios Configuration Generator (NCG)
- the Aggregated Topology Provider (ATP), the Metrics Description Database (MDDB), and the Metrics Results Database (MRDB)
- the message bus to publish results and a programmatic interface
- the visualization portal (MyEGI).
SAM tool instances
Tests and probes
- Terminology
- SAM released probes
- EMI Nagios probes (old page instance)
- Probes from org.SAM package
Profiles
For OPS VO:
- ARC
- GLEXEC
- NGI
- ROC
- ROC_CRITICAL
- ROC_OPERATORS
- WLCG_CREAM_CRITICAL
- WLCG_CREAM_LCGCE_CRITICAL (profile used for EGI Availability/Reliability computation - OPS VO)
- WLCG_CRITICAL
- WLCG_CRITICAL_TEST
OTHERS
Documentation
Release Notes
SAM Project Tracking
Installation instructions
- Sam Doc home - New confluence page
- Installation Instruction -NEW Confluence page
- Setting up a VO SAM instance
- NAGIOS&NCG YAim Based Installation Instruction -OLD page with YAIM variables definition
- SAM/NAGIOS Reference Card for sitemanger
- SAM Administrators FAQ
Monitoring uncertified sites
Tools information pages
MyEGI
NCG
Databases
- Aggregated Topology Provider (ATP)
- Profile Management Database (POEM)
- Metric Result Store (MRS)
SAM Milestones
Related Procedures
- Validate ROC or NGI Nagios Procedures: PROC05
- Setting a Nagios test status to OPERATIONS: PROC06
- Adding new probes to SAM: PROC07
- Management of the EGI OPS Availability and Reliability Profile: PROC08
SAM/Nagios EGI Support Procedures
External Links
- SAM Project home page
- Multi Level Monitoring Overview
- Computation of Service Availability Metrics in ACE
- SAM-PI documentation (Non official wiki page containing SAM PI examples)