EGI-InSPIRE:Ibergrid-QR4
Quarterly Report Number | NGI Name | Partner Name | Author |
---|---|---|---|
QR4 | Ibergrid | LIP & CSIC | G. Borges (LIP) |
1. MEETINGS AND DISSEMINATION
1.1. CONFERENCES/WORKSHOPS ORGANISED
Date | Location | Title | Participants | Outcome (Short report & Indico URL) |
---|---|---|---|---|
<Date> | <Location> | <Title> | <Participants> | <Outcome> |
1.2. OTHER CONFERENCES/WORKSHOPS ATTENDED
Date | Location | Title | Participants | Outcome (Short report & Indico URL) |
---|---|---|---|---|
11-14 April | Vilnius, Lithuania | EGI User Forum 2011 | 3 | Scientific workflows in Kepler -hands on; HUC/VRC training event |
14-16 April | Ljubljiana, Slovenia | ICANNGA, LNCS (Springer) | 1 | Paper Accepted: Sensitiveness of Evolutionary Algorithms to the Random Number Generator |
1.3. PUBLICATIONS
Publication title | Journal / Proceedings title | Journal references Volume number Issue Pages from - to |
Authors 1. 2. 3. Et al? |
---|
2. ACTIVITY REPORT
2.1. Progress Summary
- The IBERGRID r-Nagios service has been configured to use Ibergrid core services in a failover. The only service for which there is not a failover mechanism is for MyProxy service.
- Start enforcing the support on regional macro VOs. The policy is that sites in the region should provide at least the 20% of their resources to the regional macro VOs indicated as a payment of the international services provided by the NGI. If it is not possible (if the local infrastructure is owned by dedicated projects) sites should select at least one of these macro VOs and provide opportunistic access. Simultaneously the decomission of old regional VOs started.
- Regional VOMS replication scheme implementation in progress. The aim is to keeping DB copies of a main host (MASTER) in a secondary host (SLAVE). By architecture, the slave host will only have read access to the Database entries.
- Successfull upgrades on the regional operation dashboard and on the regional nagios.
- Enforcement of the phase out of EGEE name in the information system.
- Internal discussion of the Resource Centre OLA (ongoing).
- Produce internal documentation on accounting, benchmarking and on gLite-Cluster node.
- Following a request from WLCG, we have developed a schema to support TopBDIIs in high availability mode.
- Contribution on a best effort basis to the Certification Manual documentation.
- Several IBERGRID sites contributed to the Stage Rollout process.
- Completion of the Survey for supported OS platforms, LB, service management and monitoring (for EMI components).
- Provide input to the list of components from EMI 1.0 that should be integrated in UMD with high priority.
- Both Portugal and Spain have provided self-assessment input of NGI international tasks for MS109.
- Provide feedback to deliverable D4.2 - Annual Report on the EGI Production Infrastructure
- Review of the deliverables D5.2 - Annual Report on the status of the software provisioning activity and the work of DMSU; and D6.3 - Annual Report on the Tools and Services of the Heavy User Communities
- Ibergrid ROD team was invited to share their support model and experience with regional tools during the ROD session in EGI User Forum (https://www.egi.eu/indico/contributionDisplay.py?sessionId=9&contribId=210&confId=207)
2.2. Main Achievements
- Invitation to report the IBERGRID operational model in the ROD session in EGI User Forum (https://www.egi.eu/indico/contributionDisplay.py?sessionId=9&contribId=210&confId=207)
- Assessement of the high availability schema for the regional TopBDIIs
- Successfull upgrades on the regional operation dashboard and on the regional nagios.
- Stage Rollout participation
2.3. Issues and mitigation
Issue Description | Mitigation Description |
---|---|
MyProxy represents a single point of failure for the regional nagios service | We were investigating the possibility of using Robot certificates for the IBERGRID r-Nagios |
Identified timeout problem with VOMS | Open GGUS ticket #69356 - VOMS support recognized the issue has a bug |
Sync problems between regional and central dashboard | GGUS #68414 Ticket opened and assigned to COD |
Notification problem bewtween r-NAGIOS and Operational Portal | GGUS #68414 Ticket opened and assigned to COD |
Spanish sites had a GÉANT network issue (since 13:30 PM to 18:00 PM on the 03/17/2011) with consequences in availability/reliability metrics | GGUS ticket #68848 was open in order to do not take into account that period in the A/R metrics. |