Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "EGI-InSPIRE:Ireland-QR3"

From EGIWiki
Jump to navigation Jump to search
Line 99: Line 99:
  * Record number of CPU hours recorded on during all three months at csTCDie, csUCCie and scgNUIGie.
  * Record number of CPU hours recorded on during all three months at csTCDie, csUCCie and scgNUIGie.
  * Support more international VOs at csUCCie (biomed) and NUIG (astro).
  * Support more international VOs at csUCCie (biomed) and NUIG (astro).
  * Availability and Reliability statistics for November and December were excellent (despite miscellaneous security updates and subsequent deployment)
  * Availability and Reliability statistics for November and December were excellent (despite miscellaneous security updates and subsequent deployment). January figures are not yet available.


===2.3. Issues and mitigation===
===2.3. Issues and mitigation===

Revision as of 17:26, 2 February 2011


Quarterly Report Number NGI Name Partner Name Author
QR3 NGI-IE Ireland John Walsh


1. MEETINGS AND DISSEMINATION

GENERAL GUIDELINES FOR ALL EVENTS REPORTED IN THE FOLLOWING SECTIONS:
  • please do not provide a list of participants, only give the number of people that attended
  • for outcome, please list tangible agreements, decisions instead of listing program points or presentations you made. Otherwise put: “-“
  • include your local events only if there was any EGI-related topic on the agenda
  • provide an indico URL to your presentation (if available) or to the event itself.
    • If your presentation is not available online, please send the slides to erika.swiderski@egi.eu.

Note: Complete the tables below by adding as many rows as needed. Note: Complete the tables below by adding as many rows as needed.

1.1. CONFERENCES/WORKSHOPS ORGANISED

Date Location Title Participants Outcome (Short report & Indico URL)
<Date> <Location> <Title> <Participants> <Outcome>
<Date> <Location> <Title> <Participants> <Outcome>
<Date> <Location> <Title> <Participants> Test

1.2. OTHER CONFERENCES/WORKSHOPS ATTENDED

Date Location Title Participants Outcome (Short report & Indico URL)
<Date> <Location> Title Participants Outcome
<Date> <Location> Title Participants Outcome
<Date> <Location> Title Participants Outcome Test


1.3. PUBLICATIONS

Publication title Journal / Proceedings title Journal references
Volume number
Issue

Pages from - to
Authors
1.
2.
3.
Et al?
<Publication title> <Journal/Proceedings> Vol:<volume number>
Issue:<Issue>
Pg: <from> - <to>
1.<Author 1>
2.<Author2>
3. <Author3>
<Publication title> <Journal/Proceedings> Vol:<volume number>
Issue:<Issue>
Pg: <from> - <to>
1.<Author 1>
2.<Author2>
3. <Author3>
<Publication title> <Journal/Proceedings> Vol:<volume number>
Issue:<Issue>
Pg: <from> - <to>
1.<Author 1>
2.<Author2>
3. <Author3>

2. ACTIVITY REPORT

2.1. Progress Summary

* Fully commissioned a redundant 10Gb link (with failover) from OpsCentre to Main Campus to NREN.
* Grid-Ireland started the process of creating the operational structure NGI_IE.
* Deployment of National UI (gLite 3.2)


2.2. Main Achievements

* Record number of CPU hours recorded on during all three months at csTCDie, csUCCie and scgNUIGie.
* Support more international VOs at csUCCie (biomed) and NUIG (astro).
* Availability and Reliability statistics for November and December were excellent (despite miscellaneous security updates and subsequent deployment). January figures are not yet available.

2.3. Issues and mitigation

Issue Description Mitigation Description
Power issues (overloading of UPS) at scgNUIGie in January required shutdown of 12 dual-core nodes Have deployed an additional 24-core SMP unit
NFS issues at csTCDie (vendor supplied solution related) resulted in long running jobs failing at last stage of job management split load over 4 servers, increased default timeouts and retries, forced restart of lower level services
Weaknessess in globus cleanup script (over NFS) Writing script to clean old files
csTCDie is still waiting on production ready glite-CLUSTER node type to integrate Ubuntu and PS3 nodes
VO dashboard information is sometimes not updated. Quattor sites depend on up-to-date information, thus recent issues seen with move of DTEAM to NGI-GR
Detected problem with Nagios MPI sensor at csUCCie Problem fixed by EGI Nagios tools group