Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "User:Pslizik/Structure"

From EGIWiki
Jump to navigation Jump to search
m
Line 1: Line 1:
== Operations ==
== Operations ==
* '''Best Practices'''
** Best Practices
** Best Practices/Proposed


* '''Training Guides'''
* '''Training Guides'''

Revision as of 13:56, 3 March 2011

Operations

  • Training Guides
    • Dashboard Howto
    • Tool Collection
      • Ops Portal
      • VO Dashboard
      • GGUS
      • GOCDB
      • Regional Tier 2 Tool (Peter: What's this?)
      • Nagios
      • Pakiti
      • Cacti
      • OSSEC
      • EGI Wiki (as a tool)
  • Operation Manuals
    • Procedures
      • COD Escalation Procedure
      • Creation and validation of a new Operations Centre
      • Operations Centre Decomission
      • How to validate a ROC or NGI Nagios box
      • Setting Nagios test an operations test
      • EGI CSIRT: Incident reporting
      • How to publish Site Information
      • HEP SPEC06
    • Procedures for NGIs and Sites
      • Getting started
      • Site administrator's duties
      • 1st-line support
      • ROD's duties
      • COD's duties (=> not our business, anyway)
      • GOCDB
        • Introducing a new site
        • Site downtime scheduling
        • Removing problematic sites
        • Removing unused sites
        • Removing resources
      • Intervention procedures
      • Incident reporting
    • Tools
      • GOCDB
      • Nagios
      • MyEGI
      • GIIS
      • GGUS

Howto declare a downtime Howto contact COD Howto suspend a site Howto respond to incidents Howto create an NGI (this one exists and has been approved by OMB: https://wiki.egi.eu/wiki/Operations:NewNGIs_creation) How does ROD interact with a site

Howto open a ticket in GGUS


Andres':

https://twiki.cern.ch/twiki/bin/view/EGEE/OperationalProceduresforROCsAndSites:

  • Howto become an NGI manager (first time)
  • Howto request for help as a site administrator
  • Howto respond to a request to act on an incident as a site administrator
  • Howto enter a downtime for a site (in GOCDB and/or in the dashboard) as a site administrator - Howto put a site into downtime for urgent matters as a ROD
  • Howto report a problem (in GGUS) as a site administrator/as a user...
  • Howto handle tickets (in GGUS), as 1st level supporter, as ROD, as COD
  • Howto report security issues and incidents as a site administrator (procedure?) Is a vulnerability detection treated as an incident?
  • https://twiki.cern.ch/twiki/bin/view/EGEE/OperationalProceduresforROD:
  • Howto become an operations team member (first time, get Grid Cert., register into dteam VO, subscribe to mailinglists, register into GGUS, request GOCDB access, become familiar with the following manuals: ...)
  • Howto work as a team member in 1st level support
  • Howto handle new incidents as a 1st level support
  • Howto work as ROD
  • Howto handover ROD shifts
  • How does ROD interact with a site


Remarks:

  • timeline for the whole documentation: before end of March (really?)
  • dashboard howto is to be put into the dashboard manual, so there's no need for having procedures/howtos concerning dashboard
  • COD Operations manual: COD is a small group. Suggestion: COD manual should be owned by COD people, ie. content is periodically reviewed by the COD partner.
  • connection (linking) to other specific and well-defined documentation is welcomed