Underperforming sites and suspensions

From EGIWiki
Revision as of 15:08, 17 October 2014 by Caifti (talk | contribs)
Jump to: navigation, search
Main EGI.eu operations services Support Documentation Tools Activities Performance Technology Catch-all Services Resource Allocation Security




Information about underperforming and suspended Resource Centres is provided in the table below.

date GGUSID nb. sites

below 80%/85% targets

(before May 2014 75%/70%) 

nb. sites

below the target for 3 months

  • site name, NGI/ROC, short reason to not suspend (if applicable), GGUS ID
nb. sites

which didn't provide explanation

  • site name, NGI/ROC, short reason to not suspend (if applicable), GGUS ID
nb. sites

suspended by Operation/NGI

  • site name, NGI/ROC, short reason, GGUS ID
nb. sites

suspended by CSIRT

  • site name, NGI/ROC, short reason, GGUS ID
Sept 2014 109027 74

X

x x
0
June 2014 106738 66

16

  • KR-KISTI-GCRT-01, ROC_Asia/Pacific - 107135 - sit. improved starting with 18.07
  • MY-UTM-GRID, ROC_Asia/Pacific - 107136 - 0% ava/rel since 13.07, site unresponsive
  • PK-CIIT, ROC_Asia/Pacific - 107137 - 0% ava/rel in the last 30 days, site complains hardware pb, upgrads of services to UMD3
  • BG01-IPP, NGI_BG - 107130 - many days with 0% ava/rel and no downtimes
  • BG08-MADARA, BG01-IPP - 107105 - NGI recommends suspension, but sit. improved a lot during last 30days
  • CY-01-KIMON, NGI_CYGRID - [1] - problems using proxy server myproxy.hellasgrid.gr, sit. improved (100% ava/rel) in the last 5 days
  • SCAI, NGI_DE - 107120 - site had to update CAs, but the sit. didn't improve
  • BRGM-ORLEANS, NGI_FRANCE - 107108 - the site is being decommissioned
  • CFP-IST, NGI_IBERGRID - 107109 - problems with their physical infrastructure at the university, which took long time to solve
  • IEETA, NGI_IBERGRID - 107123 - Handover & overload of new responsible. Since the mid of July, the site is performing well again
  • INFN-CAGLIARI, INFN_IT - 107124 - several failures in the past months, sit. unstable in the last month
  • INFN-LECCE, INFN_IT - 107125 - reinstalltion of services on new hardware, sit. improved in the last month
  • RO-09-UTCN, NGI_RO - 107113 - still 0% ava/rel, no improvement, unresponsive site
  • RO-11-NIPNE, NGI_RO - 107114 - problem with old CAs, sit. improved in the last month
  • RO-15-NIPNE, NGI_RO - 107115 - unresponsive site, sit. improved in the last month
  • ATLAND, ROC_LA - 107116 - network problems, sit. still unstable
x 4
  • INFN-CAGLIARI, INFN_IT - 107124 - several failures in the past months, sit. unstable also in July
  • MY-UTM-GRID, ROC_Asia/Pacific - 107135 - low ava/rel (0%) continued also in the following months
  • RO-09-UTCN, NGI_RO - 107113 - 0% ava/rel, sites has problems with its CREAM CE
  • PK-CIIT, ROC_Asia/Pacific - 107137 - 0% ava/rel
0
April 2014 [2] 26

1

0 0 0
March 2014 101947 23 4 1 TH-HAII 1 TH-HAII 0
February 2014 101091 5 5 0 0 0
January 2014 99508 35 5
December 2013 100118 3 3 1 1
  • ru-PNPI (ROC RUSSIA), site unresponsive, 3 consecutive months below target (av/rel), suspended by COD, 99709
November 2013 98976 30 3 0 2
October 2013 97923 2 2

2

2
September 2013 97162 51 7
  • MY-MIMOS-GC-01 97189 no reply from site, suspended by ROD
  • VN-IFI 97186 network problems, suspended by ROD
  • GRISU-COMETA-IAF-CT 97185 suspended by ROD
  • SAMPA 97187 SAM nagios issues
  • KR-UOS-SSCC 97188 site OK now
  • INDIACMS-TIFR 97190 site OK now
  • UA_BITP_ARC 97191 bug in arc info system. site OK now.
0 3
  • MY-MIMOS-GC-01 97189 no reply from site, suspended by ROD
  • VN-IFI 97186 network problems, suspended by ROD
  • GRISU-COMETA-IAF-CT 97185 suspended by ROD

July 2013 96362 34 5
  • MY-MIMOS-GC-01 96498 no reply from site, suspended by ROD
  • ROC_LA/CBPF 96499 problem with publishing data from nagios. no suspension, to be monitored
  • ROC_LA/EELA-UTFSM 96500 problem with publishing data from nagios. no suspension, to be monitored
  • ROC_LA/ICN-UNAM 96501 problem with publishing data from nagios. no suspension, to be monitored
  • ROC_LA/SAMPA 96502 problem with publishing data from nagios. no suspension, to be monitored
1


  • MY-MIMOS-GC-01 96498 no reply from site, suspended by ROD



June 2013 95609 51 5
  • NGI_BG/BG01-IPP 95834 persistent file system issues. Hope to fix this soon. No suspension.
  • NGI_IT/GRISU-COMETA-INFN-CT 95835
  • NGI_IT/INFN-ROMA1-VIRGO 95836 Problem with the statistics. No suspension.
  • ROC_LA/ICN-UNAM 95837 Regional nagios issues. Installed a new one. No suspension.
  • ROC_LA/SAMPA 95838 Regional nagios issues. Installed a new one. No suspension.
N/A 1
  • NGI_IT/GRISU-COMETA-INFN-CT 95835 Suspended by ROD
May 2013 94557 27

1

  • ROC_LA ICN-UNAM 94656Problem with regional Nagios, not at site. Not supending.
April 2013 93826 21

3

  • ROC_Asia/Pacific/INDIACMS-TIFR 94060
  • NGI_IT/GRISU-COMETA-INFN-LNS 94061
  • ROC_LA/ICN-UNAM 94062
N/A 1
  • NGI_IT/GRISU-COMETA-INFN-LNS 94061 suspended by ROD

March 2013 93262 29

3

  • ROC_Asia/Pacific/NZ-UOA 93468 site unresponsive, availability 0 for 3 months. Suspending.
  • ROC_Asia/Pacific/TH-HAII 93469 updated CE node to EMI2 - performance now improved. Not suspending.
  • NGI_RO/RO-02-NIPNE 93470 hardware problems, software upgrades - performance now improved. Not suspending.
N/A

1

ROC_Asia/Pacific/NZ-UOA, site unresponsive, availability 0 for 3 months 93468


February 2013 92264 31

12

  • ROC_Asia/Pacific/TH-HAII 92405 Problems now solved
  • NGI_ARMGRID/AM-04-YERPHI 92406 site came back after suspension
  • NGI_FRANCE/BRGM-ORLEANS 92407 issues upgrading to UMD
  • NGI_IT/INFN-NAPOLI-PAMELA 92408 site came back after suspension
  • NGI_RO/RO-01-NIPNE 92409 site has been offline. problem solved now.
  • NGI_RO/RO-09-UTCN 92410 issues upgrading to UMD
  • NGI_RO/RO-11-NIPN 92411 installation of new hardware. reinstallation of site
  • NGI_UK/UKI-LT2-UCL-HEP 92412 issues upgrading to UMD
  • ROC_Russia/RU-ISA-CGTDC 92413 site has been decommissioned
  • ROC_Russia/RU-SPbSU 92414 A/R will be recomputed
  • ROC_Russia/Ru-Troitsk-INR-LCG2 92415 A/R will be recomputed
  • ROC_Russia/ru-Moscow-SINP-LCG2 92416 Problem now solved
N/A

0


January 2013 91218 45

9

  • NGI/FRANCE/BRGM-ORLEANS 91411
  • NGI_NL/LSG-AMS 91412
  • NGI_RO/RO-09-UTCN 91413 update to UMD-2 + hardware problems
  • NGI_RO/RO-11-NIPNE 91414 major hardware upgrades.
  • NGI_RO/RO-15-NIPNE 91415
  • NGI_TR/TR-03-METU 91416problems with old DPM. New hardware planned.
  • ROC_LA/SAMPA 91417 update to EMI-2 related problems.
  • ROC_Russia/RU-SPbSu 91418 - retirement of old regional Nagios. New instance in place.
  • ROC_Russia/ru-Troitsk-INR-LCG2 91419 - retirement of old regional Nagios. New instance in place.
N/A

1

  • NGI_NL/LSG-AMS 91412 suspended by ROD

December 2012 90185 54

12

  • ROC_Asia/Pacific/MY-MIMOS-GC-01 90187 - Downtime M/W upgrades
  • NGI_HR/egee.irb.hr 90188 - M/W upgrades,H/W replacements
  • NGI_IT/Hephy-Vienna 90189 - M/W upgrades, cooling problems
  • NGI_NL/LSG-AMS 90190 - Issues with NICs
  • NGI_RO/NIHAM 90191 - Nagios issue, recomputation requested.
  • NGI_RO/RO-15-NIPNE 90192 - M/W upgrades, fibre issues
  • ROC_LA/EELA-UNLP 90193 - Suspended by ROD
  • ROC_LA/ULA-MERIDA 90194 - Suspended by ROD
  • ROC_LA/ATLAND 90195 - Suspended by ROD
  • ROC_LA/CBPF 90196 - Nagios issues
  • ROC_LA/ICN-UNAM 90197 - Nagios issues
  • ROC_LA/SAMPA 90198 - Nagios issues
N/A

3

  • ROC_LA/EELA-UNLP 90193 - Suspended by ROD
  • ROC_LA/ULA-MERIDA 90194 - Suspended by ROD
  • ROC_LA/ATLAND 90195 - Suspended by ROD



November 2012 89223 72

13

  • ROC_Asia/Pacific/KR-UOS-SSCC 89224 - nagios issue, recomputation requested
  • ROC_Asia/Pacific/MY-MIMOS-GC-01 89225 - nagios issue, recomputation requested
  • NGI_BG/BG05-SUGrid 89227 - series of issues, now fixed
  • NGI_BG/BG02-IM 89227 - suspended by ROD
  • NGI_RO/NIHAM 89230 - nagios issue, recomputation requested
  • ROC_LA/EELA-UNLP 89231 - nagios issues
  • ROC_LA/UFRJ-IF 89232 - Suspended by ROD
  • ROC_LA/ULA-MERIDA 89233 - suspended by ROD
  • ROC_LA/ATLAND 89234 - nagios issues
  • ROC_LA/CBPF 89235 - nagios issues
  • ROC_LA/ICN-UNAM 89236 - nagios issues
  • ROC_LA/SAMPA 89237 - nagios issues
  • ROC_LA/SUPERCOMPUTO-UNAM 89238 - Suspended by ROD
N/A

4

  • NGI_BG/BG02-IM 89227 - suspended by ROD
  • ROC_LA/UFRJ-IF 89232 - Suspended by ROD
  • ROC_LA/ULA-MERIDA 89233 - suspended by ROD
  • ROC_LA/SUPERCOMPUTO-UNAM 89238 - Suspended by ROD

September 2012 86871 51

8

  • ROC_Asia/Pacific/PK-CIIT 86803 - Suspended by COD due to lack of improvement.
  • NGI_BG/BG08-MADARA 86804 Suspended by COD due to lack of improvement.
  • NGI_IT/AREA-BO 86805 Not suspended due to good performance this month. NGI claims the problem is solved.
  • NGI_NL/LSG-LUMC 86807 Not suspended due to good performance this month. NGI claims the problem is solved.
  • NGI_IT/GRISU-COMETA-UNICT-DMI 86806 - Suspended by COD due to lack of improvement.
  • NGI_RO/NIHAM 86808 Not suspended due to rise of availability since 22.10.
  • ROC_IGALC/UFCG-LSD 86809 Already decommissioned.
  • ROC_LA/UFRJ-IF 87233 Not suspended due to move to ROC_LA and central Nagios problem - to be checked next month.
0 3
  • ROC_Asia/Pacific/PK-CIIT 86803
  • NGI_BG/BG08-MADARA 86804
  • NGI_IT/GRISU-COMETA-UNICT-DMI 86806

August 2012 85774 44

2

  • BG08-MADARA, NGI_BG, ..., 85843 Infrastructural problems who are close to being fixed
  • BY-BNTU, NGI_BY, ..., 85844 site suspended by ROD
0 0
July 2012 84771 30

4

  • INDIACMS-TIFR, AsiaPacific, suspended by ROC, 84860
  • CFP-IST, NGI_IBERGRID, not suspended: hardware problems resolved, site performed above the threshold during last month, 84861
  • ICN-UNAM, ROC_LA, not suspended: they probably overcome problems, if situation occures again, suspend in September 84863
  • TR-05-BOUN, NGI_TR, suspended by ROC, 84862
0 0
June 2012 83902 40 1
  • MY-UUM-SINTOK, AsiaPacific, low availability for three months, suspended by NGI 84075

3

  • NGI_BG/BG08-MADARA, 84044
  • NGI_BG/BG06-GPHI, 84043
  • ROC_Russia/RU-SPbSU, 84074
0
May 2012 82784 33 0
0 0

April 2012 81843 34 3
  • CEFET-RJ, ROC_IGALC, low availability for three months, suspended by ROC, 81974
  • ID-ITB, AsiaPacific, low availability for three months suspended by ROC, 81972
  • PH-ASTI-LIKNAYAN AsiaPacific, low availability for three months, site was suspended but set to certified in order to recertify it, 81973
0 0
March 2012 80900 25

2

  • BY-BNTU, NGI_BY, not suspended because it achieves 86% in April, 81086
  • AM-05-YSU, NGI_ARMENIA, site suspended by ROC, 81087
0


February 2012 79844 35 3 0 2 suspended by ROC
  • IN-DAE-VECC-02, 80037
  • MY-UM-CRYSTAL, 80038
0
January 2012 79020 30 3 0
0
1; GARR-01-DIR, NGI_IT due to issue (EGI-20120116-01) recorded in RTIR ticket #3300)
December 2011 78040 28 3 0
0
0
November 2011 77170 26 3 0
0
0
October 2011 76305 45 3

3

  • ROC_Russia/RRC-KI, 76408
  • NGI_IL/WEIZMANN-LCG2, 76386
  • NGI_IL/TECHNION-HEP, 76384

1

  • NGI_ARMGRID/AM-04-YERPHI,76428
0
September 2011 74965 25 6 0

2

  • ROC_Asia/Pacific/TW-NCUHEP, 3 consecutive months below the target (site suspended by ROC), #75041
  • ROC_Asia/Pacific/IN-DAE-VECC-02, 4 consecutive months below the target (site suspended by COD), #75040
0
August 2011 74147 31 3 0 0 0
July 2011 73193 35 4 0

2

  • ROC_Asia/Pacific/TW-eScience, 3 consecutive months below the target(site suspended by ROC), ticket #73539
  • ROC_Asia/Pacific/TW-NYMU-GRID,3 consecutive months below the target(site suspended by ROC), ticket #73540
0
June 2011 72259 31 8 0 5
  • ROC_Asia, Pacific/PH-ASTI-LIKNAYAN, 3 consecutive months below target, ticket #72435
  • ROC_Asia, Pacific/TH-HAI, 3 consecutive months below target, #72436
  • NGI_IBERGRID, UOGRID, 3 consecutive months below target, #72439
  • ROC_IGALC, CEFET-RJ, 3 consecutive months below target, #72440
  • ROC_IGALC, LA-MERIDA, 3 consecutive months below target, #72441
0
May 2011 71643 23

4

this month first time targed was changed to ava 70% rel 75%

0 2
  • UOGRID, NGI_IBERGRID, below 50%/50% for three months, 71787
  • PH-ASTI-LIKNAYAN, ROC_Asia/Pacific, below 50%/50% for three months(site suspended by ROC), 71789 
0
April 2011 70289 39 0 0 0 0
March 2011 69629 30 0

1

  • ru-Moscow-SINP-LCG2, ROC_Russia, 69765
0 0
February 2011 68299 / See also 68229 28 0 1
  • RU-SPbSU, ROC_Russia

0
Jan 2011 67008 27 2 1
  • ru-IMPB-LCG2,ROC_Russia, 67038
2
  • MY-UM-PANG5, ROC_Asia/Pacific, below 50%/50% for three months(site suspended by ROC), 67010
  • ru-IMPB-LCG2,ROC_Russia, sites didn't provided explanation,67038
0
Dec 2010 65971 30 0 0 0 0
Nov 2010 64892 40 1 0 1
  • ID-ITB, ROC_Asia/Pacific,below 50%/50% for three months (site suspended by ROC),64951
0
Oct 2010 63658 37 4 1
  • AM-04-YERPHI, NGI_ARMGRID, 63854
2
  • AM-01-IIAP-NAS-RA, NGI_ARMGRID, below 50%/50% for three months, 63837
  • AM-04-YERPHI, NGI_ARMGRID, no explanation given, 63854
0
Sep 2010 62853 39 2 0 0 0
Aug 2010 61797 43 3 10
  • JP-HIROSHIMA-WLCG, ROC AP, 62323
  • AU-PPS, ROC AP,62316
  • MY-UTM-GRID, ROC AP, 62312
  • TH-NECTEC-LSR, ROC AP, 62304
  • MY-UM-CRYSTAL, ROC AP, 62300
  • ID-ITB, ROC AP, 62296
  • GRISU-COMETA-INFN-LNS, ROC Italy, 62314
  • UNIGE-DPNC, NGI_NDGF, 62308
  • ru-IMPB-LCG2, ROC_Russia, 62305
  • IL-TAU-HEP,ROC_SE, 62306
1
  • TW-NCUHEP, ROC AP, below 50%/50% for three months, 62340
0
Jul 2010 61115 47 4 0 0 0
Jun 2010 60216 59 4 0 0 0
May 2010 59736 38 3 1
  • ru-Chernogolovka-IPCP-LCG2, ROC RU, 59819
0 0