- Mailinglist: email@example.com (managed by Victor Hazlewood firstname.lastname@example.org)
- Meetings: EGI-CF 2013, Telcon, XSEDE13-BoF, EGI TF 2013 + minutes (doc),
The goal of this collaboration is to identify and exchange best practices and solutions between the XSEDE and EGI e-infrastructures so they can operate more efficiently to serve scientists in the U.S. and Europe. The collaboration is focussed on four areas of work:
- Operation of e-infrastructure services
- Cloud services for science and education
- Champions to engage with new users
- User support and joint use cases
Each area has a named contact point from EGI and from XSEDE. Further details about the work under these four areas are provided below.
Background information on XSEDE
The Extreme Science and Engineering Discovery Environment (XSEDE) is the most advanced, powerful, and robust collection of integrated advanced digital resources and services in the world. It is a single virtual system that scientists can use to interactively share computing resources, data, and expertise. XSEDE is a five-year, $121-million project supported by the National Science Foundation. It replaces and expands on the NSF TeraGrid project.
Background information on EGI
The European Grid Infrastructure (EGI) delivers integrated computing services to European researchers, driving innovation and enabling new solutions to answer the big questions of tomorrow. EGI is a federation of over 340 resource centres, set up to provide computing services and resources to European researchers and their international collaborators. EGI supports research collaborations of all sizes: from the large teams behind the Large Hadron Collider at CERN and Research Infrastructures in the ESFRI roadmap, to the individuals and small research groups that equally contribute to innovation in Europe.
Area 1: Operations
- EGI: Tiziana Ferrari <email@example.com>, Malgorzata Krakowian <firstname.lastname@example.org>
- XSEDE: Victor Hazlewood <email@example.com>
- Organisational benchmarking - compare operational processes and services – so we can identify the good practices and learn these from each other
- Helpdesk and ticket procedures (e.g. Escalation processes)
- Resource monitoring
- Use the multi-infrastructure use cases to decide what should be changed in the infrastructures and on which side (Compchem and WeNMR)
- Integration of helpdesk and accounting to support communities that in the future will jointly use XSEDE and involve OSG for those communities like wenmr that will also consume OSG resources
- Accounting (17.09.2013)on Rob Quick: OSG will share with XSEDE information how they publish accounting data to EGI
- [DONE] Support (17.09.2013)on Małgorzata Krakowian: Send EGI Helpdesk team contact to Victor Hazlewood
- Support (17.09.2013)on Alessandro Costantini: review WeNMR ticketing workflow if it meets Compchem requirements
- Support (17.09.2013)on Alessandro Costantini and Marco Verlato: Describe ticketing use case (tools + procedures) which are now in place
- Authentication (17.09.2013) on TBD: identify the IGTF CAs that XSEDE needs to authorize
- Authentication (17.09.2013) on TBD (XSEDE): get the XSEDE Operations Security team to add the identified CAs to the XSEDE accepted CAs list
- The EGI Helpdesk GGUS. It is based on Remedy, authentication of users is based on X.509 certificates, access through Shibboleth is being implemented. All users with a valid certificate released by a IGTF CA can have read access to all tickets. GGUS already supports an interface to RT, as various National Grid Initiatives have RT as local helpdesk system. GGUS Interfaces
- The GGUS Report Generator is the system we are using to collect statistics about usage, distribution of tickets, time to respond and solve tickets etc.
- Plenty of GGUS documentation is available on-line: GGUS Documentation
- Escalation procedures. Various procedures are in place depending on the type of issue to be escalated. A few examples:
The EGI accounting infrastructure is distributed with a central accounting database gathering usage records either directly from each individual resource centres or from national databases. The central accounting database is based on APEL. Accounting records are being collected for computing jobs that are successfully DONE. Storage and cloud accounting will be rolled to production in 2013 and are being tested.
EGI monitoring is distributed and is based on Service Availability Monitoring (SAM). Monitoring data is centrally gathered for access to historical information and for the computation of performance indicators (availability and reliability). SAM is installed by individual National Grid Initiatives, monitoring data is exchanged through messaging (ActiveMQ).
- EGI Monitoring information can be consulted via MyEGI.
Procedures and procedures
- Contacts: Ken Hackworth, XSEDE Allocations Manager; Dave Hart, POPS
XSEDE's current allocations system, called "POPS", already supports allocations of storage resources. In fact, POPS is general enough that it will allow to specify any kind of resource (and associated "billing unit") for allocations.
"POPS 2.0", which is a re-engineering of the POPS system to disentangle it from some legacy decisions, re-work legacy code that dates back to the late 1990s (along with 15+ years of incremental changes), and better integrate it with other components of the XSEDE infrastructure. As part of that process we are designing POPS as "allocations software as a service" such that non-XSEDE client organizations could spin up their own allocations process within the XSEDE service, with minimal effort/investment for the client organization. We're still fairly early in the design and implementation stage, but if EGI might be interested in being a 'customer' of this XSEDE service, we should certainly talk.
- Tiziana Ferrari, Malgorzata Krakowian/EGI.eu
Area 2: Cloud
- EGI: David Wallom <firstname.lastname@example.org>
- XSEDE: David Lifka <email@example.com>
- VMI preparation – collaboration on creating endorsed images of common software
- Identify questions that the technical support teams will have to be able to answer
- Hybrid cloud setup – internal cloud that is kept busy, outburst some load to external clouds when needed.
Area 3: Champions
- EGI: Sara Coelho <firstname.lastname@example.org>
- XSEDE: Kay Hunt <email@example.com>
Area 4: User support
- EGI: Gergely Sipos <firstname.lastname@example.org>
- XSEDE: Sergiu Sanielevici <email@example.com>, Suresh Marru <firstname.lastname@example.org>
- Support the implementation of the use cases that have been submitted to the 'Collaborative Use Examples' call
- Computational Chemistry use case
- WeNRM use case
- Identify science gateways that would benefit from resources from XSEDE and EGI, and facilitate such integration activities.
- Exchange information, best practiices and tools from the user support and technical outreach areas.