Difference between revisions of "User:Pslizik/Structure"
Line 49: | Line 49: | ||
*** GIIS | *** GIIS | ||
*** GGUS | *** GGUS | ||
Howto declare a downtime | |||
Howto contact COD | |||
Howto suspend a site | |||
Howto respond to incidents | |||
Howto create an NGI (this one exists and has been approved by OMB: https://wiki.egi.eu/wiki/Operations:NewNGIs_creation) | |||
How does ROD interact with a site | |||
Howto open a ticket in GGUS | |||
--- | |||
Howto declare a downtime | |||
Howto contact COD | |||
Howto suspend a site (isn't this rather a procedure?) | |||
Howto respond to incidents | |||
Howto create an NGI (https://wiki.egi.eu/wiki/Operations:NewNGIs_creation) | |||
Howto set Nagios test an operations test (https://wiki.egi.eu/wiki/Operations:Procedure_for_setting_Nagios_test_operational_for_COD) | |||
Howto register and certify a site (https://wiki.egi.eu/wiki/SiteCertMan) | |||
Howto retire a Grid component (https://wiki.egi.eu/wiki/Operations:RetiringGridComponent) | |||
Howto decommission an Operations Center (https://wiki.egi.eu/wiki/Operations:Operations_Centre_decommission) | |||
Howto register as a site, see first part of https://wiki.egi.eu/wiki/SiteCertMan | |||
https://twiki.cern.ch/twiki/bin/view/EGEE/OperationalProceduresforROCsAndSites: | |||
Howto become an NGI manager (first time) | |||
Howto request for help as a site administrator | |||
Howto respond to a request to act on an incident as a site administrator | |||
Howto enter a downtime for a site (in GOCDB and/or in the dashboard) as a site administrator - Howto put a site into downtime for urgent matters as a ROD | |||
Howto report a problem (in GGUS) as a site administrator/as a user... | |||
Howto handle tickets (in GGUS), as 1st level supporter, as ROD, as COD | |||
Howto report security issues and incidents as a site administrator (procedure?) Is a vulnerability detection treated as an incident? | |||
https://twiki.cern.ch/twiki/bin/view/EGEE/OperationalProceduresforROD: | |||
Howto become an operations team member (first time, get Grid Cert., register into dteam VO, subscribe to mailinglists, register into GGUS, request GOCDB access, become familiar with the following manuals: ...) | |||
Howto work as a team member in 1st level support | |||
Howto handle new incidents as a 1st level support | |||
Howto work as ROD | |||
Howto handover ROD shifts | |||
How does ROD interact with a site | |||
Remarks: | |||
- timeline for the whole documentation: before end of March (really?) | |||
- dashboard howto is to be put into the dashboard manual, so there's no need for having procedures/howtos concerning dashboard | |||
- COD Operations manual: COD is a small group. Suggestion: COD manual should be owned by COD people, ie. content is periodically reviewed by the COD partner. | |||
- connection (linking) to other specific and well-defined documentation is welcomed |
Revision as of 17:40, 7 February 2011
Operations
- Best Practices
- Best Practices
- Best Practices/Proposed
- Training Guides
- Dashboard Howto
- Tool Collection
- Ops Portal
- VO Dashboard
- GGUS
- GOCDB
- Regional Tier 2 Tool (Peter: What's this?)
- Nagios
- Pakiti
- Cacti
- OSSEC
- EGI Wiki (as a tool)
- Operation Manuals
- Procedures
- COD Escalation Procedure
- Creation and validation of a new Operations Centre
- Operations Centre Decomission
- How to validate a ROC or NGI Nagios box
- Setting Nagios test an operations test
- EGI CSIRT: Incident reporting
- How to publish Site Information
- HEP SPEC06
- Procedures for NGIs and Sites
- Getting started
- Site administrator's duties
- 1st-line support
- ROD's duties
- COD's duties (=> not our business, anyway)
- GOCDB
- Introducing a new site
- Site downtime scheduling
- Removing problematic sites
- Removing unused sites
- Removing resources
- Intervention procedures
- Incident reporting
- Tools
- GOCDB
- Nagios
- MyEGI
- GIIS
- GGUS
- Procedures
Howto declare a downtime Howto contact COD Howto suspend a site Howto respond to incidents Howto create an NGI (this one exists and has been approved by OMB: https://wiki.egi.eu/wiki/Operations:NewNGIs_creation) How does ROD interact with a site
Howto open a ticket in GGUS
---
Howto declare a downtime Howto contact COD Howto suspend a site (isn't this rather a procedure?) Howto respond to incidents Howto create an NGI (https://wiki.egi.eu/wiki/Operations:NewNGIs_creation) Howto set Nagios test an operations test (https://wiki.egi.eu/wiki/Operations:Procedure_for_setting_Nagios_test_operational_for_COD) Howto register and certify a site (https://wiki.egi.eu/wiki/SiteCertMan) Howto retire a Grid component (https://wiki.egi.eu/wiki/Operations:RetiringGridComponent) Howto decommission an Operations Center (https://wiki.egi.eu/wiki/Operations:Operations_Centre_decommission) Howto register as a site, see first part of https://wiki.egi.eu/wiki/SiteCertMan
https://twiki.cern.ch/twiki/bin/view/EGEE/OperationalProceduresforROCsAndSites: Howto become an NGI manager (first time) Howto request for help as a site administrator Howto respond to a request to act on an incident as a site administrator Howto enter a downtime for a site (in GOCDB and/or in the dashboard) as a site administrator - Howto put a site into downtime for urgent matters as a ROD Howto report a problem (in GGUS) as a site administrator/as a user... Howto handle tickets (in GGUS), as 1st level supporter, as ROD, as COD Howto report security issues and incidents as a site administrator (procedure?) Is a vulnerability detection treated as an incident?
https://twiki.cern.ch/twiki/bin/view/EGEE/OperationalProceduresforROD: Howto become an operations team member (first time, get Grid Cert., register into dteam VO, subscribe to mailinglists, register into GGUS, request GOCDB access, become familiar with the following manuals: ...) Howto work as a team member in 1st level support Howto handle new incidents as a 1st level support Howto work as ROD Howto handover ROD shifts
How does ROD interact with a site
Remarks:
- timeline for the whole documentation: before end of March (really?)
- dashboard howto is to be put into the dashboard manual, so there's no need for having procedures/howtos concerning dashboard
- COD Operations manual: COD is a small group. Suggestion: COD manual should be owned by COD people, ie. content is periodically reviewed by the COD partner.
- connection (linking) to other specific and well-defined documentation is welcomed