Underperforming sites and suspensions

From EGIWiki
Jump to: navigation, search
Main EGI.eu operations services Support Documentation Tools Activities Performance Technology Catch-all Services Resource Allocation Security



Alert.png This article is Deprecated and should no longer be used, but is still available for reasons of reference.


Information about underperforming and suspended Resource Centres is provided in the table below.

date GGUSID nb. sites

below 80%/85% targets

(before May 2014 75%/70%) 

nb. sites

below the target for 3 months

  • site name, NGI/ROC, short reason to not suspend (if applicable), GGUS ID
nb. sites

which didn't provide explanation

  • site name, NGI/ROC, short reason to not suspend (if applicable), GGUS ID
nb. sites

suspended by Operation/NGI

  • site name, NGI/ROC, short reason, GGUS ID
nb. sites

suspended by CSIRT

  • site name, NGI/ROC, short reason, GGUS ID
Oct 2014 10090

21


  • CBPF
  • ICN-UNAM
  • SAMPA
  • UNI-DORTMUND
  • IN-DAE-VECC-02
  • INDIACMS-TIFR
  • KR-UOS-SSCC
  • MY-USM-GCL
  • TW-FTT
  • TW-NTU-HEP
  • TW-eScience
  • Taiwan-LCG2
  • DZ-01-ARN
  • EG-ZC-T3
  • MA-01-CNRST
  • ZA-CHPC
  • UA-KNU
  • INFN-NAPOLI-ARGO
  • INFN-NAPOLI-PAMELA
  • INFN-ROMA1-VIRGO
  • MSFG-OPEN
  • INFN-NAPOLI-ARGO NGI IT
  • MD-02-IMINGI MD
Sept 2014 109027 86

18

  • INDIACMS-TIFR, ROC_Asia/Pacific - 109229 - - site solved the issues and the availability/reliability improved
  • KR-UOS-SSCC, ROC_Asia/Pacific - 109230 - issues were solved and the availability and reliability are 100%
  • MY-USM-GCL, ROC_Asia/Pacific - 109231 - site improving in the last month
  • TW-eScience, ROC_Asia/Pacific - 109232 - Situation is ok and improving
  • Taiwan-LCG2, ROC_Asia/Pacific - 109233 - site solved the issues and the availability/reliability is improving
  • BG05-SUGrid, NGI_BG - 109234 - low availability/reliability due to frequent power outage, situation improved
  • CY-01-KIMON, NGI_CYGRID - 109235 - issues with jobs remaining in waiting status forever solved and the availability and reliability for the last 23 days is 100%
  • UNI-DORTMUND, NGI_DE - 109236 - sites will improve in the coming months, last errors are due to "external" SE
  • MSFG-OPEN, NGI_FRANCE - 109237 - site under decommisisoning, migrating to "cloud", no improvement expected for the time being
  • CIEMAT-TIC, NGI_IBERGRID - 109238 - site solved all the issue and is continuously improving
  • INFN-NAPOLI-ARGO, NGI_IT - 109239 - last 30 days behaviour is OK
  • INFN-ROMA1-VIRGO, NGI_IT - 109240 - site solved the issues and availability and reliability results are improving
  • INFN-ROMA2, NGI_IT - 109241 - the site recovered its functionality, the last 30 days availability behaviour is good
  • BCBR, NGI_NL - 109242 - Site improved in the last period, confident the availability & reliability will be above threshold
  • CBPF, ROC_LA - 109243 - site solved all issues and availability & reliability numbers are improving
  • ICN-UNAM, ROC_LA - 109244 - site issues solved and availability & reliability are improving
  • SAMPA, ROC_LA - 109245 - Site solved issues. Availability and reliability are improving in the last days
  • RU-SPbSU, ROC_Russia - 109246 - Site solved the issues. Availability and reliability are improving
0 0 0
June 2014 106738 66

16

  • KR-KISTI-GCRT-01, ROC_Asia/Pacific - 107135 - sit. improved starting with 18.07
  • MY-UTM-GRID, ROC_Asia/Pacific - 107136 - 0% ava/rel since 13.07, site unresponsive
  • PK-CIIT, ROC_Asia/Pacific - 107137 - 0% ava/rel in the last 30 days, site complains hardware pb, upgrads of services to UMD3
  • BG01-IPP, NGI_BG - 107130 - many days with 0% ava/rel and no downtimes
  • BG08-MADARA, BG01-IPP - 107105 - NGI recommends suspension, but sit. improved a lot during last 30days
  • CY-01-KIMON, NGI_CYGRID - [1] - problems using proxy server myproxy.hellasgrid.gr, sit. improved (100% ava/rel) in the last 5 days
  • SCAI, NGI_DE - 107120 - site had to update CAs, but the sit. didn't improve
  • BRGM-ORLEANS, NGI_FRANCE - 107108 - the site is being decommissioned
  • CFP-IST, NGI_IBERGRID - 107109 - problems with their physical infrastructure at the university, which took long time to solve
  • IEETA, NGI_IBERGRID - 107123 - Handover & overload of new responsible. Since the mid of July, the site is performing well again
  • INFN-CAGLIARI, INFN_IT - 107124 - several failures in the past months, sit. unstable in the last month
  • INFN-LECCE, INFN_IT - 107125 - reinstalltion of services on new hardware, sit. improved in the last month
  • RO-09-UTCN, NGI_RO - 107113 - still 0% ava/rel, no improvement, unresponsive site
  • RO-11-NIPNE, NGI_RO - 107114 - problem with old CAs, sit. improved in the last month
  • RO-15-NIPNE, NGI_RO - 107115 - unresponsive site, sit. improved in the last month
  • ATLAND, ROC_LA - 107116 - network problems, sit. still unstable
x 4
  • INFN-CAGLIARI, INFN_IT - 107124 - several failures in the past months, sit. unstable also in July
  • MY-UTM-GRID, ROC_Asia/Pacific - 107135 - low ava/rel (0%) continued also in the following months
  • RO-09-UTCN, NGI_RO - 107113 - 0% ava/rel, sites has problems with its CREAM CE
  • PK-CIIT, ROC_Asia/Pacific - 107137 - 0% ava/rel
0
April 2014 [2] 26

1

0 0 0
March 2014 101947 23 4 1 TH-HAII 1 TH-HAII 0
February 2014 101091 5 5 0 0 0
January 2014 99508 35 5
December 2013 100118 3 3 1 1
  • ru-PNPI (ROC RUSSIA), site unresponsive, 3 consecutive months below target (av/rel), suspended by COD, 99709
November 2013 98976 30 3 0 2
October 2013 97923 2 2

2

2
September 2013 97162 51 7
  • MY-MIMOS-GC-01 97189 no reply from site, suspended by ROD
  • VN-IFI 97186 network problems, suspended by ROD
  • GRISU-COMETA-IAF-CT 97185 suspended by ROD
  • SAMPA 97187 SAM nagios issues
  • KR-UOS-SSCC 97188 site OK now
  • INDIACMS-TIFR 97190 site OK now
  • UA_BITP_ARC 97191 bug in arc info system. site OK now.
0 3
  • MY-MIMOS-GC-01 97189 no reply from site, suspended by ROD
  • VN-IFI 97186 network problems, suspended by ROD
  • GRISU-COMETA-IAF-CT 97185 suspended by ROD

July 2013 96362 34 5
  • MY-MIMOS-GC-01 96498 no reply from site, suspended by ROD
  • ROC_LA/CBPF 96499 problem with publishing data from nagios. no suspension, to be monitored
  • ROC_LA/EELA-UTFSM 96500 problem with publishing data from nagios. no suspension, to be monitored
  • ROC_LA/ICN-UNAM 96501 problem with publishing data from nagios. no suspension, to be monitored
  • ROC_LA/SAMPA 96502 problem with publishing data from nagios. no suspension, to be monitored
1


  • MY-MIMOS-GC-01 96498 no reply from site, suspended by ROD



June 2013 95609 51 5
  • NGI_BG/BG01-IPP 95834 persistent file system issues. Hope to fix this soon. No suspension.
  • NGI_IT/GRISU-COMETA-INFN-CT 95835
  • NGI_IT/INFN-ROMA1-VIRGO 95836 Problem with the statistics. No suspension.
  • ROC_LA/ICN-UNAM 95837 Regional nagios issues. Installed a new one. No suspension.
  • ROC_LA/SAMPA 95838 Regional nagios issues. Installed a new one. No suspension.
N/A 1
  • NGI_IT/GRISU-COMETA-INFN-CT 95835 Suspended by ROD
May 2013 94557 27

1

  • ROC_LA ICN-UNAM 94656Problem with regional Nagios, not at site. Not supending.
April 2013 93826 21

3

  • ROC_Asia/Pacific/INDIACMS-TIFR 94060
  • NGI_IT/GRISU-COMETA-INFN-LNS 94061
  • ROC_LA/ICN-UNAM 94062
N/A 1
  • NGI_IT/GRISU-COMETA-INFN-LNS 94061 suspended by ROD

March 2013 93262 29

3

  • ROC_Asia/Pacific/NZ-UOA 93468 site unresponsive, availability 0 for 3 months. Suspending.
  • ROC_Asia/Pacific/TH-HAII 93469 updated CE node to EMI2 - performance now improved. Not suspending.
  • NGI_RO/RO-02-NIPNE 93470 hardware problems, software upgrades - performance now improved. Not suspending.
N/A

1

ROC_Asia/Pacific/NZ-UOA, site unresponsive, availability 0 for 3 months 93468


February 2013 92264 31

12

  • ROC_Asia/Pacific/TH-HAII 92405 Problems now solved
  • NGI_ARMGRID/AM-04-YERPHI 92406 site came back after suspension
  • NGI_FRANCE/BRGM-ORLEANS 92407 issues upgrading to UMD
  • NGI_IT/INFN-NAPOLI-PAMELA 92408 site came back after suspension
  • NGI_RO/RO-01-NIPNE 92409 site has been offline. problem solved now.
  • NGI_RO/RO-09-UTCN 92410 issues upgrading to UMD
  • NGI_RO/RO-11-NIPN 92411 installation of new hardware. reinstallation of site
  • NGI_UK/UKI-LT2-UCL-HEP 92412 issues upgrading to UMD
  • ROC_Russia/RU-ISA-CGTDC 92413 site has been decommissioned
  • ROC_Russia/RU-SPbSU 92414 A/R will be recomputed
  • ROC_Russia/Ru-Troitsk-INR-LCG2 92415 A/R will be recomputed
  • ROC_Russia/ru-Moscow-SINP-LCG2 92416 Problem now solved
N/A

0


January 2013 91218 45

9

  • NGI/FRANCE/BRGM-ORLEANS 91411
  • NGI_NL/LSG-AMS 91412
  • NGI_RO/RO-09-UTCN 91413 update to UMD-2 + hardware problems
  • NGI_RO/RO-11-NIPNE 91414 major hardware upgrades.
  • NGI_RO/RO-15-NIPNE 91415
  • NGI_TR/TR-03-METU 91416problems with old DPM. New hardware planned.
  • ROC_LA/SAMPA 91417 update to EMI-2 related problems.
  • ROC_Russia/RU-SPbSu 91418 - retirement of old regional Nagios. New instance in place.
  • ROC_Russia/ru-Troitsk-INR-LCG2 91419 - retirement of old regional Nagios. New instance in place.
N/A

1

  • NGI_NL/LSG-AMS 91412 suspended by ROD

December 2012 90185 54

12

  • ROC_Asia/Pacific/MY-MIMOS-GC-01 90187 - Downtime M/W upgrades
  • NGI_HR/egee.irb.hr 90188 - M/W upgrades,H/W replacements
  • NGI_IT/Hephy-Vienna 90189 - M/W upgrades, cooling problems
  • NGI_NL/LSG-AMS 90190 - Issues with NICs
  • NGI_RO/NIHAM 90191 - Nagios issue, recomputation requested.
  • NGI_RO/RO-15-NIPNE 90192 - M/W upgrades, fibre issues
  • ROC_LA/EELA-UNLP 90193 - Suspended by ROD
  • ROC_LA/ULA-MERIDA 90194 - Suspended by ROD
  • ROC_LA/ATLAND 90195 - Suspended by ROD
  • ROC_LA/CBPF 90196 - Nagios issues
  • ROC_LA/ICN-UNAM 90197 - Nagios issues
  • ROC_LA/SAMPA 90198 - Nagios issues
N/A

3

  • ROC_LA/EELA-UNLP 90193 - Suspended by ROD
  • ROC_LA/ULA-MERIDA 90194 - Suspended by ROD
  • ROC_LA/ATLAND 90195 - Suspended by ROD



November 2012 89223 72

13

  • ROC_Asia/Pacific/KR-UOS-SSCC 89224 - nagios issue, recomputation requested
  • ROC_Asia/Pacific/MY-MIMOS-GC-01 89225 - nagios issue, recomputation requested
  • NGI_BG/BG05-SUGrid 89227 - series of issues, now fixed
  • NGI_BG/BG02-IM 89227 - suspended by ROD
  • NGI_RO/NIHAM 89230 - nagios issue, recomputation requested
  • ROC_LA/EELA-UNLP 89231 - nagios issues
  • ROC_LA/UFRJ-IF 89232 - Suspended by ROD
  • ROC_LA/ULA-MERIDA 89233 - suspended by ROD
  • ROC_LA/ATLAND 89234 - nagios issues
  • ROC_LA/CBPF 89235 - nagios issues
  • ROC_LA/ICN-UNAM 89236 - nagios issues
  • ROC_LA/SAMPA 89237 - nagios issues
  • ROC_LA/SUPERCOMPUTO-UNAM 89238 - Suspended by ROD
N/A

4

  • NGI_BG/BG02-IM 89227 - suspended by ROD
  • ROC_LA/UFRJ-IF 89232 - Suspended by ROD
  • ROC_LA/ULA-MERIDA 89233 - suspended by ROD
  • ROC_LA/SUPERCOMPUTO-UNAM 89238 - Suspended by ROD

September 2012 86871 51

8

  • ROC_Asia/Pacific/PK-CIIT 86803 - Suspended by COD due to lack of improvement.
  • NGI_BG/BG08-MADARA 86804 Suspended by COD due to lack of improvement.
  • NGI_IT/AREA-BO 86805 Not suspended due to good performance this month. NGI claims the problem is solved.
  • NGI_NL/LSG-LUMC 86807 Not suspended due to good performance this month. NGI claims the problem is solved.
  • NGI_IT/GRISU-COMETA-UNICT-DMI 86806 - Suspended by COD due to lack of improvement.
  • NGI_RO/NIHAM 86808 Not suspended due to rise of availability since 22.10.
  • ROC_IGALC/UFCG-LSD 86809 Already decommissioned.
  • ROC_LA/UFRJ-IF 87233 Not suspended due to move to ROC_LA and central Nagios problem - to be checked next month.
0 3
  • ROC_Asia/Pacific/PK-CIIT 86803
  • NGI_BG/BG08-MADARA 86804
  • NGI_IT/GRISU-COMETA-UNICT-DMI 86806

August 2012 85774 44

2

  • BG08-MADARA, NGI_BG, ..., 85843 Infrastructural problems who are close to being fixed
  • BY-BNTU, NGI_BY, ..., 85844 site suspended by ROD
0 0
July 2012 84771 30

4

  • INDIACMS-TIFR, AsiaPacific, suspended by ROC, 84860
  • CFP-IST, NGI_IBERGRID, not suspended: hardware problems resolved, site performed above the threshold during last month, 84861
  • ICN-UNAM, ROC_LA, not suspended: they probably overcome problems, if situation occures again, suspend in September 84863
  • TR-05-BOUN, NGI_TR, suspended by ROC, 84862
0 0
June 2012 83902 40 1
  • MY-UUM-SINTOK, AsiaPacific, low availability for three months, suspended by NGI 84075

3

  • NGI_BG/BG08-MADARA, 84044
  • NGI_BG/BG06-GPHI, 84043
  • ROC_Russia/RU-SPbSU, 84074
0
May 2012 82784 33 0
0 0

April 2012 81843 34 3
  • CEFET-RJ, ROC_IGALC, low availability for three months, suspended by ROC, 81974
  • ID-ITB, AsiaPacific, low availability for three months suspended by ROC, 81972
  • PH-ASTI-LIKNAYAN AsiaPacific, low availability for three months, site was suspended but set to certified in order to recertify it, 81973
0 0
March 2012 80900 25

2

  • BY-BNTU, NGI_BY, not suspended because it achieves 86% in April, 81086
  • AM-05-YSU, NGI_ARMENIA, site suspended by ROC, 81087
0


February 2012 79844 35 3 0 2 suspended by ROC
  • IN-DAE-VECC-02, 80037
  • MY-UM-CRYSTAL, 80038
0
January 2012 79020 30 3 0
0
1; GARR-01-DIR, NGI_IT due to issue (EGI-20120116-01) recorded in RTIR ticket #3300)
December 2011 78040 28 3 0
0
0
November 2011 77170 26 3 0
0
0
October 2011 76305 45 3

3

  • ROC_Russia/RRC-KI, 76408
  • NGI_IL/WEIZMANN-LCG2, 76386
  • NGI_IL/TECHNION-HEP, 76384

1

  • NGI_ARMGRID/AM-04-YERPHI,76428
0
September 2011 74965 25 6 0

2

  • ROC_Asia/Pacific/TW-NCUHEP, 3 consecutive months below the target (site suspended by ROC), #75041
  • ROC_Asia/Pacific/IN-DAE-VECC-02, 4 consecutive months below the target (site suspended by COD), #75040
0
August 2011 74147 31 3 0 0 0
July 2011 73193 35 4 0

2

  • ROC_Asia/Pacific/TW-eScience, 3 consecutive months below the target(site suspended by ROC), ticket #73539
  • ROC_Asia/Pacific/TW-NYMU-GRID,3 consecutive months below the target(site suspended by ROC), ticket #73540
0
June 2011 72259 31 8 0 5
  • ROC_Asia, Pacific/PH-ASTI-LIKNAYAN, 3 consecutive months below target, ticket #72435
  • ROC_Asia, Pacific/TH-HAI, 3 consecutive months below target, #72436
  • NGI_IBERGRID, UOGRID, 3 consecutive months below target, #72439
  • ROC_IGALC, CEFET-RJ, 3 consecutive months below target, #72440
  • ROC_IGALC, LA-MERIDA, 3 consecutive months below target, #72441
0
May 2011 71643 23

4

this month first time targed was changed to ava 70% rel 75%

0 2
  • UOGRID, NGI_IBERGRID, below 50%/50% for three months, 71787
  • PH-ASTI-LIKNAYAN, ROC_Asia/Pacific, below 50%/50% for three months(site suspended by ROC), 71789 
0
April 2011 70289 39 0 0 0 0
March 2011 69629 30 0

1

  • ru-Moscow-SINP-LCG2, ROC_Russia, 69765
0 0
February 2011 68299 / See also 68229 28 0 1
  • RU-SPbSU, ROC_Russia

0
Jan 2011 67008 27 2 1
  • ru-IMPB-LCG2,ROC_Russia, 67038
2
  • MY-UM-PANG5, ROC_Asia/Pacific, below 50%/50% for three months(site suspended by ROC), 67010
  • ru-IMPB-LCG2,ROC_Russia, sites didn't provided explanation,67038
0
Dec 2010 65971 30 0 0 0 0
Nov 2010 64892 40 1 0 1
  • ID-ITB, ROC_Asia/Pacific,below 50%/50% for three months (site suspended by ROC),64951
0
Oct 2010 63658 37 4 1
  • AM-04-YERPHI, NGI_ARMGRID, 63854
2
  • AM-01-IIAP-NAS-RA, NGI_ARMGRID, below 50%/50% for three months, 63837
  • AM-04-YERPHI, NGI_ARMGRID, no explanation given, 63854
0
Sep 2010 62853 39 2 0 0 0
Aug 2010 61797 43 3 10
  • JP-HIROSHIMA-WLCG, ROC AP, 62323
  • AU-PPS, ROC AP,62316
  • MY-UTM-GRID, ROC AP, 62312
  • TH-NECTEC-LSR, ROC AP, 62304
  • MY-UM-CRYSTAL, ROC AP, 62300
  • ID-ITB, ROC AP, 62296
  • GRISU-COMETA-INFN-LNS, ROC Italy, 62314
  • UNIGE-DPNC, NGI_NDGF, 62308
  • ru-IMPB-LCG2, ROC_Russia, 62305
  • IL-TAU-HEP,ROC_SE, 62306
1
  • TW-NCUHEP, ROC AP, below 50%/50% for three months, 62340
0
Jul 2010 61115 47 4 0 0 0
Jun 2010 60216 59 4 0 0 0
May 2010 59736 38 3 1
  • ru-Chernogolovka-IPCP-LCG2, ROC RU, 59819
0 0


Personal tools
Namespaces
Variants
Actions
Navigation
Toolbox
Print/export