Operations
Jump to navigation
Jump to search
EGI Operations
Contact information
You are welcome to get in contact with EGI Operations via e-mail!
Contact address: operations at egi.eu
Overview of EGI-InSPIRE SA1
Tasks, activities and coordinators | Deliverables and Milestones | EGI-InSPIRE SA1 Quality Metrics
General information
Infrastructure and Resource Providers
- Full list resource providers who are contributing resources to the EGI infrastructure.
- Join the infrastructure. Step-by-step instructions for a Grid site to become part of the EGI infrastructure.
Operations boards and agendas
- Operational Tools: Operational Tool Advisory Group (OTAG) | Operations Automation Team (OAT) | EGI-InSPIRE JRA1
- Security: EGI CSIRT
- EGI-InSPIRE SA1: EGI-InSPIRE SA1 meetings
Task forces
- Operational Level Agreement (OLA) Task Force - October 2010. Purpose of the OLA task force is to propose a plan for an extension and further development of the current OLA framework of EGI. The plan, expected for end of November 2010, will be used as input for discussion at the OMB.
- Meetings | Milestone MS404 (Operational Level Agreements (OLAs) within the EGI production infrastructure)
Interoperability of EGI operations
Operational security
Security:Main Page | EGI Computer Security and Incident Response Team | EGI Software Vulnerability Group
Middleware rollout to production infrastructure
This section describes the process and procedures to deploy new versions of Grid middleware components into the EGI production infrastructure.
Distribution process
- Middleware:Release_Process
- Deploying Software into the EGI production infrastructure, Milestone MS402
- EGI IGTF Release Process
Supported middleware
- WLCG baseline clients and services
- Supported versions of gLite Clients (to be updated)
- Supported versions of gLite Services (to be updated)
- Procedure for retiring middleware services (to be updated)
- Processes for Maintaining and Enforcing a List of Supported gLite Middleware Service
Tools
- Operational tools: operations dashboard | Operations portal | GOCDB (Grid configuration repository) | Gstat 2.0 | MyEGI portal
- Network monitoring: Network monitoring tools (temporary link)
- Accounting: Accounting portal | Accounting enforcement
- Availability: GridView availability graphs | Monthly availability: historical reports | Availability Excel Reports
- Support: EGI Helpdesk
- Metrics: Metrics portal
Tool documentation
- Documentation on operational tools
- Status of ROC/NGI Nagios services
- EGEE-III OAT wiki
Daily Operations, Support and Documentation
- Accounting
- Monitoring infrastructure
- Critical SAM/Nagios Probes
- SAM probes (draft, will be release in Nov 2010, and replaces https://twiki.cern.ch/twiki/bin/view/LCG/SAMProbesMetrics)
- Monitoring uncertified sites with Nagios
- Aggregated Topology Provider (ATP)
- EGI Helpdesk
- GGUS ticket timeline tool
- GGUS report generator
- GGUS escalation reports
- GGUS documentation (including information on team and alarm tickets)
- EGEE-III User Support Advisory Group
- Grid operations oversight
- Operations news archive
- Availability/reliability monthly statistics and suspended sites
Procedures and best practices
Documentation
ARC
gLite
- gLite latest release
- gLite User guide and general docuementation
- EGEE -III MPI WG Recommendations and Improvements for the current MPI Implementation and Ideas for Extension, final version 1.0
UNICORE
- Download server & central services: Core Server | Workflow System | Common Information Service
- Download clients: Rich Client | Command Line Client | HiLA API
- Documentation
Tools
- GGUS documentation and tutorials (including information on alarm and team tickets)
- Documentation on EGI operational tools
- Troubleshooting with GStat 2.0
Operational Level Agreements
- Operational Level Agreement between a NGI and a site
Resources
EGI-InSPIRE presentation template | EGI-InSPIRE document template | EGI-InSPIRE quarterly report template