Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "User:Pslizik/Structure"

From EGIWiki
Jump to navigation Jump to search
m
 
(7 intermediate revisions by the same user not shown)
Line 1: Line 1:
== Operations ==
* '''Best Practices'''
** Best Practices
** Best Practices/Proposed
* '''Training Guides'''
** Dashboard Howto
** Tool Collection
*** Ops Portal
*** VO Dashboard
*** GGUS
*** GOCDB
*** Regional Tier 2 Tool ''(Peter: What's this?)''
*** Nagios
*** Pakiti
*** Cacti
*** OSSEC
*** EGI Wiki (as a tool)
* '''Operation Manuals'''
** Procedures
*** COD Escalation Procedure
*** Creation and validation of a new Operations Centre
*** Operations Centre Decomission
*** How to validate a ROC or NGI Nagios box
*** Setting Nagios test an operations test
*** EGI CSIRT: Incident reporting
*** How to publish Site Information
*** HEP SPEC06
** Procedures for NGIs and Sites
*** Getting started
*** Site administrator's duties
*** 1st-line support
*** ROD's duties
*** COD's duties (=> not our business, anyway)
*** GOCDB
**** Introducing a new site
**** Site downtime scheduling
**** Removing problematic sites
**** Removing unused sites
**** Removing resources
*** Intervention procedures
*** Incident reporting
** Tools
*** GOCDB
*** Nagios
*** MyEGI
*** GIIS
*** GGUS
Howto declare a downtime
Howto contact COD
Howto suspend a site
Howto respond to incidents
Howto create an NGI (this one exists and has been approved by OMB: https://wiki.egi.eu/wiki/Operations:NewNGIs_creation)
How does ROD interact with a site
Howto open a ticket in GGUS
----
'''Andres':'''
* Howto declare a downtime
* Howto contact COD
* Howto suspend a site (isn't this rather a procedure?)
* Howto respond to incidents
* Howto create an NGI (https://wiki.egi.eu/wiki/Operations:NewNGIs_creation)
* Howto set Nagios test an operations test (https://wiki.egi.eu/wiki/Operations:Procedure_for_setting_Nagios_test_operational_for_COD)
* Howto register and certify a site (https://wiki.egi.eu/wiki/SiteCertMan)
* Howto retire a Grid component (https://wiki.egi.eu/wiki/Operations:RetiringGridComponent)
* Howto decommission an Operations Center (https://wiki.egi.eu/wiki/Operations:Operations_Centre_decommission)
* Howto register as a site, see first part of https://wiki.egi.eu/wiki/SiteCertMan
https://twiki.cern.ch/twiki/bin/view/EGEE/OperationalProceduresforROCsAndSites:
* Howto become an NGI manager (first time)
* Howto request for help as a site administrator
* Howto respond to a request to act on an incident as a site administrator
* Howto enter a downtime for a site (in GOCDB and/or in the dashboard) as a site administrator - Howto put a site into downtime for urgent matters as a ROD
* Howto report a problem (in GGUS) as a site administrator/as a user...
* Howto handle tickets (in GGUS), as 1st level supporter, as ROD, as COD
* Howto report security issues and incidents as a site administrator (procedure?) Is a vulnerability detection treated as an incident?
* https://twiki.cern.ch/twiki/bin/view/EGEE/OperationalProceduresforROD:
* Howto become an operations team member (first time, get Grid Cert., register into dteam VO, subscribe to mailinglists, register into GGUS, request GOCDB access, become familiar with the following manuals: ...)
* Howto work as a team member in 1st level support
* Howto handle new incidents as a 1st level support
* Howto work as ROD
* Howto handover ROD shifts
* How does ROD interact with a site
'''Remarks:'''
* timeline for the whole documentation: before end of March (really?)
* dashboard howto is to be put into the dashboard manual, so there's no need for having procedures/howtos concerning dashboard
* COD Operations manual: COD is a small group. Suggestion: COD manual should be owned by COD people, ie. content is periodically reviewed by the COD partner.
* connection (linking) to other specific and well-defined documentation is welcomed
----
* '''http://wiki.egi.eu/Operations'''
* '''http://wiki.egi.eu/Operations'''
** http://wiki.egi.eu/Operations/Joining_the_team ''(Howto become an operations team member - first-timer: get Grid Cert., register into dteam VO, subscribe to mailinglists, register into GGUS, request GOCDB access, become familiar with the following manuals: ...)''
** '''http://wiki.egi.eu/Operations/Joining_the_team''' ''Probably to be included in "Howto_become" a ROC, a COD, a 1st-liner...''
 
*** get Grid certificate
* '''http://wiki.egi.eu/Operations/Downtimes''' ''(Should this be a separate entity, or should it go under Operations/GOCDB/Downtimes?)''
*** register into dteam VO
** http://wiki.egi.eu/Operations/Downtimes/Declaring ''(both as a site administrator and as a ROD for urgent matters)''
*** subscribe to mailinglists
** http://wiki.egi.eu/Operations/Downtimes/Extending
*** register into GGUS
** http://wiki.egi.eu/Operations/Downtimes/Cancelling
*** request GOCDB access
 
<!-- ===================================================================================================================================== -->
* '''http://wiki.egi.eu/Operations/Monitoring/''' ''(Maybe Operation/Nagios would be better - more specific. Or maybe not.)''
** '''http://wiki.egi.eu/Operations/Downtimes''' ''(Should this be a separate entity, or should it go under Operations/GOCDB/Downtimes?)''
** http://wiki.egi.eu/Operations/Monitoring/Stuff ''(Howto set Nagios test an operations test - https://wiki.egi.eu/wiki/Operations:Procedure_for_setting_Nagios_test_operational_for_COD)''
*** http://wiki.egi.eu/Operations/Downtimes/Declaring ''(both as a site administrator and as a ROD for urgent matters)''
 
*** http://wiki.egi.eu/Operations/Downtimes/Extending
* '''http://wiki.egi.eu/Operations/Tickets''' ''(Should this be here or should it go onder Operations/GGUS/Tickes?)''
*** http://wiki.egi.eu/Operations/Downtimes/Cancelling
** http://wiki.egi.eu/Operations/Tickets/Creating ''(as a site administrator, as a user)''
<!-- ===================================================================================================================================== -->
** http://wiki.egi.eu/Operations/Tickets/Handling ''(as a 1st-line supporter, as ROD, as COD)''
** '''http://wiki.egi.eu/Operations/Monitoring/''' ''(Maybe Operation/Nagios would be better - more specific. Or maybe not.)''
 
*** http://wiki.egi.eu/Operations/Monitoring/Stuff ''(Howto set Nagios test an operations test - https://wiki.egi.eu/wiki/Operations:Procedure_for_setting_Nagios_test_operational_for_COD)''
* '''http://wiki.egi.eu/Operations/Incidents'''
<!-- ===================================================================================================================================== -->
** http://wiki.egi.eu/Operations/Incidents/Reporting
** '''http://wiki.egi.eu/Operations/Tickets''' ''(Should this be here or should it go onder Operations/GGUS/Tickes?)''
** http://wiki.egi.eu/Operations/Incidents/Handling
*** http://wiki.egi.eu/Operations/Tickets/Creating ''(as a site administrator, as a user)''
** http://wiki.egi.eu/Operations/Incidents/Responding_to (How to respond to incidents)
*** http://wiki.egi.eu/Operations/Tickets/Handling ''(as a 1st-line supporter, as ROD, as COD)''
 
<!-- ===================================================================================================================================== -->
* '''http://wiki.egi.eu/Operations/Sites'''
** '''http://wiki.egi.eu/Operations/Incidents'''
** http://wiki.egi.eu/Operations/Sites/Registering ''(https://wiki.egi.eu/wiki/SiteCertMan)''
*** http://wiki.egi.eu/Operations/Incidents/Reporting
** http://wiki.egi.eu/Operations/Sites/Certifying ''(https://wiki.egi.eu/wiki/SiteCertMan)''
*** http://wiki.egi.eu/Operations/Incidents/Handling
** http://wiki.egi.eu/Operations/Sites/Suspending
*** http://wiki.egi.eu/Operations/Incidents/Responding_to (How to respond to incidents - for site administrators)
** http://wiki.egi.eu/Operations/Sites/Decommissioning
<!-- ===================================================================================================================================== -->
 
** '''http://wiki.egi.eu/Operations/User'''
* '''http://wiki.egi.eu/Operations/Site_admin'''
*** http://wiki.egi.eu/Operations/User/Asking_for_help
** http://wiki.egi.eu/Operations/Site_admin/Become_one ''(Howto become an NGI manager - first time)''
*** http://wiki.egi.eu/Operations/User/Reporting_problems ''(GGUS)''
** http://wiki.egi.eu/Operations/Site_admin/Asking_for_help
<!-- ===================================================================================================================================== -->
** http://wiki.egi.eu/Operations/Site_admin/Responding_to_incidents
** '''http://wiki.egi.eu/Operations/Sites'''
 
*** http://wiki.egi.eu/Operations/Sites/Registering ''(https://wiki.egi.eu/wiki/SiteCertMan)''
* '''http://wiki.egi.eu/Operations/NGIs'''
*** http://wiki.egi.eu/Operations/Sites/Certifying ''(https://wiki.egi.eu/wiki/SiteCertMan)''
** http://wiki.egi.eu/Operations/NGIs/Creation (https://wiki.egi.eu/wiki/Operations:NewNGIs_creation)
*** http://wiki.egi.eu/Operations/Sites/Suspending
** http://wiki.egi.eu/Operations/NGIs/Tidyup (https://wiki.egi.eu/wiki/Operations:Operations_Centre_decommission)
*** http://wiki.egi.eu/Operations/Sites/Decommissioning
 
*** http://wiki.egi.eu/Operations/Sites/Retiring_a_component
* '''http://wiki.egi.eu/Operations/NGI_manager'''
<!-- ===================================================================================================================================== -->
** http://wiki.egi.eu/Operations/NGI_manager/Become_one ''(Howto become an NGI manager - first time)''
** '''http://wiki.egi.eu/Operations/Site_admin'''
 
*** http://wiki.egi.eu/Operations/Site_admin/Become_one ''(Howto become an NGI manager - first time)''
* '''http://wiki.egi.eu/Operations/1st-line_support'''
*** http://wiki.egi.eu/Operations/Site_admin/Duties ''(Any?)''
** http://wiki.egi.eu/Operations/1st-line_support/Becoming_a_1st-line_supporter
*** http://wiki.egi.eu/Operations/Site_admin/Asking_for_help
** http://wiki.egi.eu/Operations/1st-line_support/Duties ''(Howto work as a team member in 1st level support)''
*** http://wiki.egi.eu/Operations/Site_admin/Responding_to_incidents
** http://wiki.egi.eu/Operations/1st-line_support/Handling_incidents ''(Howto handle new incidents as a 1st level support)''
*** http://wiki.egi.eu/Operations/Site_admin/Setting_a_downtime ''(Should it be here or under Downtimes?)''
 
*** http://wiki.egi.eu/Operations/Site_admin/Reporting_problems ''(GGUS)''
* '''http://wiki.egi.eu/Operations/ROD'''
*** http://wiki.egi.eu/Operations/Site_admin/Reporting_security_incidents ''(Or file it rather under Issues?)''
** http://wiki.egi.eu/Operations/ROD/Becoming ''(How to become a ROD)''
*** http://wiki.egi.eu/Operations/Site_admin/Setting_a_downtime
** http://wiki.egi.eu/Operations/ROD/Duties ''(How to work as a ROD)''
*** http://wiki.egi.eu/Operations/Site_admin/Removing resources
** http://wiki.egi.eu/Operations/ROD/Handover ''(How to handover ROD shifts)''
<!-- ===================================================================================================================================== -->
** http://wiki.egi.eu/Operations/ROD/Interacting_with_sites
** '''http://wiki.egi.eu/Operations/NGIs'''
 
*** http://wiki.egi.eu/Operations/NGIs/Creation (https://wiki.egi.eu/wiki/Operations:NewNGIs_creation)
* '''http://wiki.egi.eu/Operations/COD'''
*** http://wiki.egi.eu/Operations/NGIs/Decommissioning (https://wiki.egi.eu/wiki/Operations:Operations_Centre_decommission)*
** http://wiki.egi.eu/Operations/COD/Becoming ''(How to become a COD)''
<!-- ===================================================================================================================================== -->
** http://wiki.egi.eu/Operations/COD/Retiring_grid_components ''(Howto retire a Grid component - https://wiki.egi.eu/wiki/Operations:RetiringGridComponent)''
** '''http://wiki.egi.eu/Operations/NGI_manager'''
*** http://wiki.egi.eu/Operations/NGI_manager/Become_one ''(Howto become an NGI manager - first time)''
<!-- ===================================================================================================================================== -->
** '''http://wiki.egi.eu/Operations/1st-line_support'''
*** http://wiki.egi.eu/Operations/1st-line_support/Becoming_a_1st-line_supporter
*** http://wiki.egi.eu/Operations/1st-line_support/Duties ''(Howto work as a team member in 1st level support)''
*** http://wiki.egi.eu/Operations/1st-line_support/Handling_incidents ''(Howto handle new incidents as a 1st level support)''
*** http://wiki.egi.eu/Operations/1st-line_support/Handling_tickets ''(Or put it rather under GGUS?)''
*** http://wiki.egi.eu/Operations/1st-line_support/Dashboard ''(a reference to the Dashboarch Manual)''
*** http://wiki.egi.eu/Operations/1st-line_support/Setting_a_downtime
<!-- ===================================================================================================================================== -->
** '''http://wiki.egi.eu/Operations/ROC'''
*** http://wiki.egi.eu/Operations/ROC/Decomissioning ''(obsolete?)''
<!-- ===================================================================================================================================== -->
** '''http://wiki.egi.eu/Operations/ROD'''
*** http://wiki.egi.eu/Operations/ROD/Becoming ''(How to become a ROD)''
*** http://wiki.egi.eu/Operations/ROD/Duties ''(How to work as a ROD)''
*** http://wiki.egi.eu/Operations/ROD/Handover ''(How to handover ROD shifts)''
*** http://wiki.egi.eu/Operations/ROD/Interacting_with_sites
*** http://wiki.egi.eu/Operations/ROD/Handling_tickets ''(Or put it rather under GGUS?)''
*** http://wiki.egi.eu/Operations/ROD/Howto_contact_COD
*** http://wiki.egi.eu/Operations/ROD/Putting_a_site_to_a_downtime_urgently ''(Should it be here or under Downtimes?)''
*** http://wiki.egi.eu/Operations/ROD/Dashboard ''(a reference to the Manual)''
*** http://wiki.egi.eu/Operations/ROD/Setting_a_downtime
*** http://wiki.egi.eu/Operations/ROD/Removing_problematic_sites
*** http://wiki.egi.eu/Operations/ROD/Removing unused sites
*** http://wiki.egi.eu/Operations/ROD/Removing resources
<!-- ===================================================================================================================================== -->
** '''http://wiki.egi.eu/Operations/COD''' ''(Obsolete now?)''
*** http://wiki.egi.eu/Operations/COD/Becoming ''(How to become a COD)''
*** http://wiki.egi.eu/Operatioins/COD/Duties''
*** http://wiki.egi.eu/Operations/COD/Retiring_grid_components ''(Howto retire a Grid component - https://wiki.egi.eu/wiki/Operations:RetiringGridComponent)''
*** http://wiki.egi.eu/wiki/Operations:Procedure_for_setting_Nagios_test_operational_for_COD ''(What to do with this?)''
*** http://wiki.egi.eu/Operations/COD/Handling_tickets ''(Or put it rather under GGUS?)''
*** http://wiki.egi.eu/Operations/COD/Setting_a_downtime
<!-- ===================================================================================================================================== -->
** '''http://wiki.egi.eu/Operations/Tools'''
*** http://wiki.egi.eu/Operations/Tools/Dashboard ''(Dashboard Howto)''
*** http://wiki.egi.eu/Operations/Tools/GOCDB
*** http://wiki.egi.eu/Operations/Tools/Nagios
*** http://wiki.egi.eu/Operations/Tools/MyEGI
*** http://wiki.egi.eu/Operations/Tools/GIIS
*** http://wiki.egi.eu/Operations/Tools/GGUS
*** http://wiki.egi.eu/Operations/Tools/Pakiti
*** http://wiki.egi.eu/Operations/Tools/Cacti
*** http://wiki.egi.eu/Operations/Tools/EGI Wiki ''(Wiki as a Knowledge Base tool)''
<!-- ===================================================================================================================================== -->
** '''http://wiki.egi.eu/Operations/Others'''
*** http://wiki.egi.eu/Operations/Others/HEP_SPEC06

Latest revision as of 15:41, 4 March 2011