Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "Unknown issue"

From EGIWiki
Jump to navigation Jump to search
Line 13: Line 13:
<br>  
<br>  


= Problems =
= Problems &amp; Questions<br> =


#No policy for test developers when test should return UNKNOWN status. What does UNKNOWN status mean?<br>  
#No policy for test developers when test should return UNKNOWN status. What does UNKNOWN status mean?<br>  
#Some of NGIs reach ~0% for all their sites and some reach even ~40%, sometimes disproporsions are even within one NGI. What/where is the reason for so high values and disproporsions?
#Some of NGIs reach ~0% for all their sites and some reach even ~40%, sometimes disproporsions are even within one NGI. What/where is the reason for so high values and disproporsions?
== What can cause UNKNOWN status? ==
tbd
== When test can return UNKNOWN&nbsp;status? ==
tbd
== What can cause UNKNOWN&nbsp;status disproporsions between sites within one NGI? ==
tbd


= Solution proposals  =
= Solution proposals  =

Revision as of 12:13, 12 October 2011

this page will contain information about UNKNOWN status issue

Present situation

Availability and Reliability calculations formulas:

Availability = Uptime / (Total time - Time_status_was_UNKNOWN)
Reliability = Uptime / (Total time - Scheduled Downtime - Time_status_was_UNKNOWN)


How to read in context of UNKNOWN status:

  1. Period in which site is in status UNKNOWN is not taken into calculation.
  2. During this period EGI doesn’t know what is happening with the infrastructure.


Problems & Questions

  1. No policy for test developers when test should return UNKNOWN status. What does UNKNOWN status mean?
  2. Some of NGIs reach ~0% for all their sites and some reach even ~40%, sometimes disproporsions are even within one NGI. What/where is the reason for so high values and disproporsions?

What can cause UNKNOWN status?

tbd

When test can return UNKNOWN status?

tbd

What can cause UNKNOWN status disproporsions between sites within one NGI?

tbd

Solution proposals

Strict policy for the developers how to use UNKNOWN status

Advantage: we will be sure that all problems will be properly addressed as ERROR not UNKNOWN
Disadvantages: someone has to write the policy and check whether it is respected

Alarms for UNKNOWN status should be created when UNKNOWN status is longer than 4h

Advantage: we will be notified if the UNKNOWN status takes too long
Disadvantages: it means an extra work for ROD which will be look not only after ERRORs but also UNKNOWNs

Threshold for UNKNOWN status

Advantage: it is easy and fast to implement and automate
Disadvantages: there is a possibility that overlook an important problem