Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "User:Pslizik/Structure"

From EGIWiki
Jump to navigation Jump to search
Line 60: Line 60:


---
---
'''Andres':'''


Howto declare a downtime
* Howto declare a downtime
Howto contact COD
* Howto contact COD
Howto suspend a site (isn't this rather a procedure?)
* Howto suspend a site (isn't this rather a procedure?)
Howto respond to incidents
* Howto respond to incidents
Howto create an NGI (https://wiki.egi.eu/wiki/Operations:NewNGIs_creation)
* Howto create an NGI (https://wiki.egi.eu/wiki/Operations:NewNGIs_creation)
Howto set Nagios test an operations test (https://wiki.egi.eu/wiki/Operations:Procedure_for_setting_Nagios_test_operational_for_COD)
* Howto set Nagios test an operations test (https://wiki.egi.eu/wiki/Operations:Procedure_for_setting_Nagios_test_operational_for_COD)
Howto register and certify a site (https://wiki.egi.eu/wiki/SiteCertMan)
* Howto register and certify a site (https://wiki.egi.eu/wiki/SiteCertMan)
Howto retire a Grid component (https://wiki.egi.eu/wiki/Operations:RetiringGridComponent)
* Howto retire a Grid component (https://wiki.egi.eu/wiki/Operations:RetiringGridComponent)
Howto decommission an Operations Center (https://wiki.egi.eu/wiki/Operations:Operations_Centre_decommission)
* Howto decommission an Operations Center (https://wiki.egi.eu/wiki/Operations:Operations_Centre_decommission)
Howto register as a site, see first part of https://wiki.egi.eu/wiki/SiteCertMan
* Howto register as a site, see first part of https://wiki.egi.eu/wiki/SiteCertMan


https://twiki.cern.ch/twiki/bin/view/EGEE/OperationalProceduresforROCsAndSites:
https://twiki.cern.ch/twiki/bin/view/EGEE/OperationalProceduresforROCsAndSites:
Howto become an NGI manager (first time)
* Howto become an NGI manager (first time)
Howto request for help as a site administrator
* Howto request for help as a site administrator
Howto respond to a request to act on an incident as a site administrator
* Howto respond to a request to act on an incident as a site administrator
Howto enter a downtime for a site (in GOCDB and/or in the dashboard) as a site administrator - Howto put a site into downtime for urgent matters as a ROD
* Howto enter a downtime for a site (in GOCDB and/or in the dashboard) as a site administrator - Howto put a site into downtime for urgent matters as a ROD
Howto report a problem (in GGUS) as a site administrator/as a user...
* Howto report a problem (in GGUS) as a site administrator/as a user...
Howto handle tickets (in GGUS), as 1st level supporter, as ROD, as COD
* Howto handle tickets (in GGUS), as 1st level supporter, as ROD, as COD
Howto report security issues and incidents as a site administrator (procedure?) Is a vulnerability detection treated as an incident?
* Howto report security issues and incidents as a site administrator (procedure?) Is a vulnerability detection treated as an incident?


https://twiki.cern.ch/twiki/bin/view/EGEE/OperationalProceduresforROD:
* https://twiki.cern.ch/twiki/bin/view/EGEE/OperationalProceduresforROD:
Howto become an operations team member (first time, get Grid Cert., register into dteam VO, subscribe to mailinglists, register into GGUS, request GOCDB access, become familiar with the following manuals: ...)
* Howto become an operations team member (first time, get Grid Cert., register into dteam VO, subscribe to mailinglists, register into GGUS, request GOCDB access, become familiar with the following manuals: ...)
Howto work as a team member in 1st level support
* Howto work as a team member in 1st level support
Howto handle new incidents as a 1st level support
* Howto handle new incidents as a 1st level support
Howto work as ROD
* Howto work as ROD
Howto handover ROD shifts
* Howto handover ROD shifts


How does ROD interact with a site
* How does ROD interact with a site




Remarks:
'''Remarks:'''
- timeline for the whole documentation: before end of March (really?)
* timeline for the whole documentation: before end of March (really?)
- dashboard howto is to be put into the dashboard manual, so there's no need for having procedures/howtos concerning dashboard
* dashboard howto is to be put into the dashboard manual, so there's no need for having procedures/howtos concerning dashboard
- COD Operations manual: COD is a small group. Suggestion: COD manual should be owned by COD people, ie. content is periodically reviewed by the COD partner.
* COD Operations manual: COD is a small group. Suggestion: COD manual should be owned by COD people, ie. content is periodically reviewed by the COD partner.
- connection (linking) to other specific and well-defined documentation is welcomed
* connection (linking) to other specific and well-defined documentation is welcomed

Revision as of 17:43, 7 February 2011

Operations

  • Best Practices
    • Best Practices
    • Best Practices/Proposed
  • Training Guides
    • Dashboard Howto
    • Tool Collection
      • Ops Portal
      • VO Dashboard
      • GGUS
      • GOCDB
      • Regional Tier 2 Tool (Peter: What's this?)
      • Nagios
      • Pakiti
      • Cacti
      • OSSEC
      • EGI Wiki (as a tool)
  • Operation Manuals
    • Procedures
      • COD Escalation Procedure
      • Creation and validation of a new Operations Centre
      • Operations Centre Decomission
      • How to validate a ROC or NGI Nagios box
      • Setting Nagios test an operations test
      • EGI CSIRT: Incident reporting
      • How to publish Site Information
      • HEP SPEC06
    • Procedures for NGIs and Sites
      • Getting started
      • Site administrator's duties
      • 1st-line support
      • ROD's duties
      • COD's duties (=> not our business, anyway)
      • GOCDB
        • Introducing a new site
        • Site downtime scheduling
        • Removing problematic sites
        • Removing unused sites
        • Removing resources
      • Intervention procedures
      • Incident reporting
    • Tools
      • GOCDB
      • Nagios
      • MyEGI
      • GIIS
      • GGUS

Howto declare a downtime Howto contact COD Howto suspend a site Howto respond to incidents Howto create an NGI (this one exists and has been approved by OMB: https://wiki.egi.eu/wiki/Operations:NewNGIs_creation) How does ROD interact with a site

Howto open a ticket in GGUS

--- Andres':

https://twiki.cern.ch/twiki/bin/view/EGEE/OperationalProceduresforROCsAndSites:

  • Howto become an NGI manager (first time)
  • Howto request for help as a site administrator
  • Howto respond to a request to act on an incident as a site administrator
  • Howto enter a downtime for a site (in GOCDB and/or in the dashboard) as a site administrator - Howto put a site into downtime for urgent matters as a ROD
  • Howto report a problem (in GGUS) as a site administrator/as a user...
  • Howto handle tickets (in GGUS), as 1st level supporter, as ROD, as COD
  • Howto report security issues and incidents as a site administrator (procedure?) Is a vulnerability detection treated as an incident?
  • https://twiki.cern.ch/twiki/bin/view/EGEE/OperationalProceduresforROD:
  • Howto become an operations team member (first time, get Grid Cert., register into dteam VO, subscribe to mailinglists, register into GGUS, request GOCDB access, become familiar with the following manuals: ...)
  • Howto work as a team member in 1st level support
  • Howto handle new incidents as a 1st level support
  • Howto work as ROD
  • Howto handover ROD shifts
  • How does ROD interact with a site


Remarks:

  • timeline for the whole documentation: before end of March (really?)
  • dashboard howto is to be put into the dashboard manual, so there's no need for having procedures/howtos concerning dashboard
  • COD Operations manual: COD is a small group. Suggestion: COD manual should be owned by COD people, ie. content is periodically reviewed by the COD partner.
  • connection (linking) to other specific and well-defined documentation is welcomed