Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "Agenda-14-03-2016"

From EGIWiki
Jump to navigation Jump to search
Line 92: Line 92:


== Monthly Availability/Reliability ==
== Monthly Availability/Reliability ==
* Last three months report availabile on [http://argo.egi.eu/lavoisier/ngi_reports?month=2016-01 ARGO]
* Problems follow-up:
** AfricaArabia: [https://ggus.eu/?mode=ticket_info&ticket_id=117094 ticket]
*** Overall A/R: 12.67/12.67
*** RCs eligible to suspension: EG-ZC-T3, ZA-CHPC, ZA-UJ
** CERN: [https://ggus.eu/index.php?mode=ticket_info&ticket_id=118843 ticket]
*** Overall A/R: 33.22/33.22
*** there were problems on the regional SAM instances, solved in January
** NGI_ARMGRID: [https://ggus.eu/index.php?mode=ticket_info&ticket_id=119415 ticket]
*** Overall A/R: 77.43/77.43
** NGI_DE: [https://ggus.eu/?mode=ticket_info&ticket_id=117099 ticket]
*** the underperforming RCs (SCAI, UNI-DORTMUND) are recovering from the issues
** NGI_GRNET: [https://ggus.eu/index.php?mode=ticket_info&ticket_id=119414 ticket]
*** RC eligible for suspension: GR-04-FORTH-ICS
** NGI_IT: [https://ggus.eu/index.php?mode=ticket_info&ticket_id=118846 ticket]
*** the underperforming RC INFN-NAPOLI-PAMELA seems to be recovering, waiting for a confirmation
** NGI_MARGI: [https://ggus.eu/index.php?mode=ticket_info&ticket_id=118465 ticket]
*** no monitoring data available since January
*** RC eligible for suspension: MK-03-FINKI
** NGI_MD:
*** Overall A/R: 61.89/61.89
*** the underperforming RC MD-02-IMI is recovering
** ROC_LA: [https://ggus.eu/index.php?mode=ticket_info&ticket_id=119416 ticket]
*** no monitoring data available for CBPF
*** RC eligible for suspension: UFAL


== Next meeting ==
== Next meeting ==


* '''14 Mar 2016''' https://indico.egi.eu/indico/conferenceDisplay.py?confId=2736
* '''14 Mar 2016''' https://indico.egi.eu/indico/conferenceDisplay.py?confId=2736

Revision as of 14:46, 7 March 2016


General information

News from URT

Staged rollout updates

Next releases

Operational issues

Aligning Fedcloud sites to the A/R procedures

  • EGI Operations proposal to align Fedcloud sites to the A/R related procedures used for the grid sites
    • based on the availability reliability of monitored services in cloudmon, EGI Operations will start follow up with underperforming sites as we are doing for every grid sites
    • sites will NOT be suspended for a/r performance at least until end of May
  • in parallel EGI Operations will start PROC08 to include cloud probes in the EGI_CRITICAL and EGI profiles used for A/R computations (IN PROGRESS)

The proposed timeline is:

  • February 2016:
    • EGI Operations will check the status of the production cloud services in order to understand which issues (if any) the site has and provide help to NGIs and sites;
    • Start of the integration of cloud probes in EGI CRITICAL profile(current set+openstack): To be agreed with the ARGO team, PROC08 will be followed
  • June 2016:
    • Starting notification of sites eligible for suspension

FedCloud status

Issues at cloud sites

Grouped by NGI, please follow up with sites.

Getting help on issues

Updating Federated_Cloud_Operation wiki

Decommissioning Debian

Decommissioning SL5

  • Tracked on SL5_retirement wiki
  • No checks for dCache, DPM, ARC, UNICORE --> Action on NGIs/ROCs to follow up directly with sites

Decommissioning dCache 2.6

  • DONE.

AOB

Monthly Availability/Reliability

Next meeting