User:Pslizik/Structure
Jump to navigation
Jump to search
Operations
- Best Practices
- Best Practices
- Best Practices/Proposed
- Training Guides
- Dashboard Howto
- Tool Collection
- Ops Portal
- VO Dashboard
- GGUS
- GOCDB
- Regional Tier 2 Tool (Peter: What's this?)
- Nagios
- Pakiti
- Cacti
- OSSEC
- EGI Wiki (as a tool)
- Operation Manuals
- Procedures
- COD Escalation Procedure
- Creation and validation of a new Operations Centre
- Operations Centre Decomission
- How to validate a ROC or NGI Nagios box
- Setting Nagios test an operations test
- EGI CSIRT: Incident reporting
- How to publish Site Information
- HEP SPEC06
- Procedures for NGIs and Sites
- Getting started
- Site administrator's duties
- 1st-line support
- ROD's duties
- COD's duties (=> not our business, anyway)
- GOCDB
- Introducing a new site
- Site downtime scheduling
- Removing problematic sites
- Removing unused sites
- Removing resources
- Intervention procedures
- Incident reporting
- Tools
- GOCDB
- Nagios
- MyEGI
- GIIS
- GGUS
- Procedures
Howto declare a downtime Howto contact COD Howto suspend a site Howto respond to incidents Howto create an NGI (this one exists and has been approved by OMB: https://wiki.egi.eu/wiki/Operations:NewNGIs_creation) How does ROD interact with a site
Howto open a ticket in GGUS
Andres':
- Howto declare a downtime
- Howto contact COD
- Howto suspend a site (isn't this rather a procedure?)
- Howto respond to incidents
- Howto create an NGI (https://wiki.egi.eu/wiki/Operations:NewNGIs_creation)
- Howto set Nagios test an operations test (https://wiki.egi.eu/wiki/Operations:Procedure_for_setting_Nagios_test_operational_for_COD)
- Howto register and certify a site (https://wiki.egi.eu/wiki/SiteCertMan)
- Howto retire a Grid component (https://wiki.egi.eu/wiki/Operations:RetiringGridComponent)
- Howto decommission an Operations Center (https://wiki.egi.eu/wiki/Operations:Operations_Centre_decommission)
- Howto register as a site, see first part of https://wiki.egi.eu/wiki/SiteCertMan
https://twiki.cern.ch/twiki/bin/view/EGEE/OperationalProceduresforROCsAndSites:
- Howto become an NGI manager (first time)
- Howto request for help as a site administrator
- Howto respond to a request to act on an incident as a site administrator
- Howto enter a downtime for a site (in GOCDB and/or in the dashboard) as a site administrator - Howto put a site into downtime for urgent matters as a ROD
- Howto report a problem (in GGUS) as a site administrator/as a user...
- Howto handle tickets (in GGUS), as 1st level supporter, as ROD, as COD
- Howto report security issues and incidents as a site administrator (procedure?) Is a vulnerability detection treated as an incident?
- https://twiki.cern.ch/twiki/bin/view/EGEE/OperationalProceduresforROD:
- Howto become an operations team member (first time, get Grid Cert., register into dteam VO, subscribe to mailinglists, register into GGUS, request GOCDB access, become familiar with the following manuals: ...)
- Howto work as a team member in 1st level support
- Howto handle new incidents as a 1st level support
- Howto work as ROD
- Howto handover ROD shifts
- How does ROD interact with a site
Remarks:
- timeline for the whole documentation: before end of March (really?)
- dashboard howto is to be put into the dashboard manual, so there's no need for having procedures/howtos concerning dashboard
- COD Operations manual: COD is a small group. Suggestion: COD manual should be owned by COD people, ie. content is periodically reviewed by the COD partner.
- connection (linking) to other specific and well-defined documentation is welcomed
- http://wiki.egi.eu/Operations
- http://wiki.egi.eu/Operations/Downtimes
- http://wiki.egi.eu/Operations/Downtimes/Declaring
- Howto declare a downtime
- Howto enter a downtime for a site (in GOCDB and/or in the dashboard) as a site administrator
- Howto put a site into downtime for urgent matters as a ROD
- http://wiki.egi.eu/Operations/Downtimes/Extending
- http://wiki.egi.eu/Operations/Downtimes/Cancelling
- http://wiki.egi.eu/Operations/Monitoring/
- Howto set Nagios test an operations test (https://wiki.egi.eu/wiki/Operations:Procedure_for_setting_Nagios_test_operational_for_COD)
- http://wiki.egi.eu/Operations/Tickets
- http://wiki.egi.eu/Operations/Tickets/Creating
- Howto report a problem (in GGUS) as a site administrator/as a user...
- http://wiki.egi.eu/Operations/Tickets/Handling
- Howto handle tickets (in GGUS), as 1st level supporter, as ROD, as COD
- http://wiki.egi.eu/Operations/Incidents
- http://wiki.egi.eu/Operations/Incidents/Reporting
- Howto report security issues and incidents as a site administrator
- (procedure?) Is a vulnerability detection treated as an incident?
- http://wiki.egi.eu/Operations/Incidents/Handling
- Howto respond to incidents (sites and/or NGI's?)
- http://wiki.egi.eu/Operations/Sites
- http://wiki.egi.eu/Operations/Sites/General
- Howto become an operations team member (first time, get Grid Cert., register into dteam VO, subscribe to mailinglists, register into GGUS, request GOCDB access, become familiar with the following manuals: ...)
- Howto request for help as a site administrator
- Howto respond to a request to act on an incident as a site administrator
- http://wiki.egi.eu/Operations/Sites/Registering
- Howto register a site (https://wiki.egi.eu/wiki/SiteCertMan)
- http://wiki.egi.eu/Operations/Sites/Certifying
- Howto certify a site (https://wiki.egi.eu/wiki/SiteCertMan)
- http://wiki.egi.eu/Operations/Sites/Suspending
- Howto suspend a site (isn't this rather a procedure?)
- http://wiki.egi.eu/Operations/Sites/Decommissioning
- http://wiki.egi.eu/Operations/NGIs
- http://wiki.egi.eu/Operations/NGIs/Creation
- Howto create an NGI (https://wiki.egi.eu/wiki/Operations:NewNGIs_creation)
- http://wiki.egi.eu/Operations/NGIs/Manager
- Howto become an NGI manager (first time)
- http://wiki.egi.eu/Operations/NGIs/Tidyup
- Howto decommission an Operations Center (https://wiki.egi.eu/wiki/Operations:Operations_Centre_decommission)
- http://wiki.egi.eu/Operations/ROD
- Howto work as a team member in 1st level support
- Howto handle new incidents as a 1st level support
- Howto work as ROD
- Howto handover ROD shifts
- How does ROD interact with a site
- http://wiki.egi.eu/Operations/COD
- Howto contact COD
- Howto retire a Grid component (https://wiki.egi.eu/wiki/Operations:RetiringGridComponent)