Difference between revisions of "Operations"
Jump to navigation
Jump to search
Line 31: | Line 31: | ||
== [[Interoperability of EGI operations]]== | == [[Interoperability of EGI operations]]== | ||
* [[NGIs and other Grids]] | * [[NGIs and other Grids]] | ||
* [UNICORE 6 Nagios sensors implementation]] | |||
* [https://twiki.cern.ch/twiki/bin/view/EGEE/Interoperation EGEE-III WLCG Interoperations] | * [https://twiki.cern.ch/twiki/bin/view/EGEE/Interoperation EGEE-III WLCG Interoperations] | ||
Revision as of 13:48, 21 September 2010
EGI Operations
Overview of EGI-InSPIRE SA1
General information
Infrastructure and Resource Providers
- Full list resource providers who are contributing resources to the EGI infrastructure.
- Join the infrastructure. Step-by-step instructions for a Grid site to become part of the EGI infrastructure.
Operations boards and agendas
- Operations
- Operational Tools
- Security: EGI CSIRT
- EGI-InSPIRE SA1: EGI-InSPIRE SA1 meetings
- Task forces
- TBD
SA1 contact information
TO be completed.
Interoperability of EGI operations
- NGIs and other Grids
- [UNICORE 6 Nagios sensors implementation]]
- EGEE-III WLCG Interoperations
Operational security
Middleware rollout to production infrastructure
This section describes the process and procedures to deploy new versions of Grid middleware components into the EGI production infrastructure.
Distribution process
- Middleware:Release_Process
- Deploying Software into the EGI production infrastructure, Milestone MS402
- EGI IGTF Release Process
Supported middleware
- WLCG baseline clients and services
- Supported versions of gLite Clients (to be updated)
- Supported versions of gLite Services (to be updated)
- Procedure for retiring middleware services (to be updated)
- Processes for Maintaining and Enforcing a List of Supported gLite Middleware Service
Tools
- Operational tools
- Operations dashboard
- CIC Operations portal
- GOCDB (Grid configuration repository)
- Gstat 2.0
- MyEGI portal
- Network monitoring
- Network monitoring tools (temporary link)
- Accounting
- Availability
- GridView availability graphs
- Monthly availability: historical reports
- Availability Excel Reports
- Support
- EGI Helpdesk
Tool documentation
- Documentation on operational tools
- Status of ROC/NGI Nagios services
- EGEE-III OAT wiki
Daily Operations, Support and Documentation
- Accounting
- Monitoring infrastructure
- SAM probes and metrics
- Monitoring uncertified sites with Nagios
- Aggregated Topology Provider (ATP)
- EGI Helpdesk
- GGUS ticket timeline tool
- GGUS report generator
- GGUS escalation reports
- GGUS documentation (including information on team and alarm tickets)
- EGEE-III User Support Advisory Group
- Grid operations oversight
- Operations news archive
- Availability/reliability monthly statistics and suspended sites
Procedures and best practices
Documentation
ARC
gLite
- gLite latest release
- gLite 3.2
- gLite 3.1
- gLite Documentation
- EGEE -III MPI WG Recommendations and Improvements for the current MPI Implementation and Ideas for Extension, final version 1.0
UNICORE
Tools
- GGUS documentation and tutorials (including information on alarm and team tickets)
- Documentation on EGI operational tools
- Troubleshooting with GStat 2.0
Operational Level Agreements
- Operational Level Agreement between a NGI and a site
Resources
- EGI-InSPIRE presentation template
- EGI-InSPIRE document template
- EGI-InSPIRE quarterly report template