Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "User:Pslizik/Structure"

From EGIWiki
Jump to navigation Jump to search
 
(8 intermediate revisions by the same user not shown)
Line 1: Line 1:
== Operations ==
* '''http://wiki.egi.eu/Operations'''
 
** '''http://wiki.egi.eu/Operations/Joining_the_team''' ''Probably to be included in "Howto_become" a ROC, a COD, a 1st-liner...''
* '''Best Practices'''
*** get Grid certificate
** Best Practices
*** register into dteam VO
** Best Practices/Proposed
*** subscribe to mailinglists
 
*** register into GGUS
* '''Training Guides'''
*** request GOCDB access
** Dashboard Howto
<!-- ===================================================================================================================================== -->
** Tool Collection
** '''http://wiki.egi.eu/Operations/Downtimes''' ''(Should this be a separate entity, or should it go under Operations/GOCDB/Downtimes?)''
*** Ops Portal
*** http://wiki.egi.eu/Operations/Downtimes/Declaring ''(both as a site administrator and as a ROD for urgent matters)''
*** VO Dashboard
*** http://wiki.egi.eu/Operations/Downtimes/Extending
*** GGUS
*** http://wiki.egi.eu/Operations/Downtimes/Cancelling
*** GOCDB
<!-- ===================================================================================================================================== -->
*** Regional Tier 2 Tool ''(Peter: What's this?)''
** '''http://wiki.egi.eu/Operations/Monitoring/''' ''(Maybe Operation/Nagios would be better - more specific. Or maybe not.)''
*** Nagios
*** http://wiki.egi.eu/Operations/Monitoring/Stuff ''(Howto set Nagios test an operations test - https://wiki.egi.eu/wiki/Operations:Procedure_for_setting_Nagios_test_operational_for_COD)''
*** Pakiti
<!-- ===================================================================================================================================== -->
*** Cacti
** '''http://wiki.egi.eu/Operations/Tickets''' ''(Should this be here or should it go onder Operations/GGUS/Tickes?)''
*** OSSEC
*** http://wiki.egi.eu/Operations/Tickets/Creating ''(as a site administrator, as a user)''
*** EGI Wiki (as a tool)
*** http://wiki.egi.eu/Operations/Tickets/Handling ''(as a 1st-line supporter, as ROD, as COD)''
 
<!-- ===================================================================================================================================== -->
* '''Operation Manuals'''
** '''http://wiki.egi.eu/Operations/Incidents'''
** Procedures
*** http://wiki.egi.eu/Operations/Incidents/Reporting
*** COD Escalation Procedure
*** http://wiki.egi.eu/Operations/Incidents/Handling
*** Creation and validation of a new Operations Centre
*** http://wiki.egi.eu/Operations/Incidents/Responding_to (How to respond to incidents - for site administrators)
*** Operations Centre Decomission
<!-- ===================================================================================================================================== -->
*** How to validate a ROC or NGI Nagios box
** '''http://wiki.egi.eu/Operations/User'''
*** Setting Nagios test an operations test
*** http://wiki.egi.eu/Operations/User/Asking_for_help
*** EGI CSIRT: Incident reporting
*** http://wiki.egi.eu/Operations/User/Reporting_problems ''(GGUS)''
*** How to publish Site Information
<!-- ===================================================================================================================================== -->
*** HEP SPEC06
** '''http://wiki.egi.eu/Operations/Sites'''
** Procedures for NGIs and Sites
*** http://wiki.egi.eu/Operations/Sites/Registering ''(https://wiki.egi.eu/wiki/SiteCertMan)''
*** Getting started
*** http://wiki.egi.eu/Operations/Sites/Certifying ''(https://wiki.egi.eu/wiki/SiteCertMan)''
*** Site administrator's duties
*** http://wiki.egi.eu/Operations/Sites/Suspending
*** 1st-line support
*** http://wiki.egi.eu/Operations/Sites/Decommissioning
*** ROD's duties
*** http://wiki.egi.eu/Operations/Sites/Retiring_a_component
*** COD's duties (=> not our business, anyway)
<!-- ===================================================================================================================================== -->
*** GOCDB
** '''http://wiki.egi.eu/Operations/Site_admin'''
**** Introducing a new site
*** http://wiki.egi.eu/Operations/Site_admin/Become_one ''(Howto become an NGI manager - first time)''
**** Site downtime scheduling
*** http://wiki.egi.eu/Operations/Site_admin/Duties ''(Any?)''
**** Removing problematic sites
*** http://wiki.egi.eu/Operations/Site_admin/Asking_for_help
**** Removing unused sites
*** http://wiki.egi.eu/Operations/Site_admin/Responding_to_incidents
**** Removing resources
*** http://wiki.egi.eu/Operations/Site_admin/Setting_a_downtime ''(Should it be here or under Downtimes?)''
*** Intervention procedures
*** http://wiki.egi.eu/Operations/Site_admin/Reporting_problems ''(GGUS)''
*** Incident reporting
*** http://wiki.egi.eu/Operations/Site_admin/Reporting_security_incidents ''(Or file it rather under Issues?)''
** Tools
*** http://wiki.egi.eu/Operations/Site_admin/Setting_a_downtime
*** GOCDB
*** http://wiki.egi.eu/Operations/Site_admin/Removing resources
*** Nagios
<!-- ===================================================================================================================================== -->
*** MyEGI
** '''http://wiki.egi.eu/Operations/NGIs'''
*** GIIS
*** http://wiki.egi.eu/Operations/NGIs/Creation (https://wiki.egi.eu/wiki/Operations:NewNGIs_creation)
*** GGUS
*** http://wiki.egi.eu/Operations/NGIs/Decommissioning (https://wiki.egi.eu/wiki/Operations:Operations_Centre_decommission)*
 
<!-- ===================================================================================================================================== -->
Howto declare a downtime
** '''http://wiki.egi.eu/Operations/NGI_manager'''
Howto contact COD
*** http://wiki.egi.eu/Operations/NGI_manager/Become_one ''(Howto become an NGI manager - first time)''
Howto suspend a site
<!-- ===================================================================================================================================== -->
Howto respond to incidents
** '''http://wiki.egi.eu/Operations/1st-line_support'''
Howto create an NGI (this one exists and has been approved by OMB: https://wiki.egi.eu/wiki/Operations:NewNGIs_creation)
*** http://wiki.egi.eu/Operations/1st-line_support/Becoming_a_1st-line_supporter
How does ROD interact with a site
*** http://wiki.egi.eu/Operations/1st-line_support/Duties ''(Howto work as a team member in 1st level support)''
 
*** http://wiki.egi.eu/Operations/1st-line_support/Handling_incidents ''(Howto handle new incidents as a 1st level support)''
Howto open a ticket in GGUS
*** http://wiki.egi.eu/Operations/1st-line_support/Handling_tickets ''(Or put it rather under GGUS?)''
 
*** http://wiki.egi.eu/Operations/1st-line_support/Dashboard ''(a reference to the Dashboarch Manual)''
----
*** http://wiki.egi.eu/Operations/1st-line_support/Setting_a_downtime
 
<!-- ===================================================================================================================================== -->
'''Andres':'''
** '''http://wiki.egi.eu/Operations/ROC'''
 
*** http://wiki.egi.eu/Operations/ROC/Decomissioning ''(obsolete?)''
* Howto declare a downtime
<!-- ===================================================================================================================================== -->
* Howto contact COD
** '''http://wiki.egi.eu/Operations/ROD'''
* Howto suspend a site (isn't this rather a procedure?)
*** http://wiki.egi.eu/Operations/ROD/Becoming ''(How to become a ROD)''
* Howto respond to incidents
*** http://wiki.egi.eu/Operations/ROD/Duties ''(How to work as a ROD)''
* Howto create an NGI (https://wiki.egi.eu/wiki/Operations:NewNGIs_creation)
*** http://wiki.egi.eu/Operations/ROD/Handover ''(How to handover ROD shifts)''
* Howto set Nagios test an operations test (https://wiki.egi.eu/wiki/Operations:Procedure_for_setting_Nagios_test_operational_for_COD)
*** http://wiki.egi.eu/Operations/ROD/Interacting_with_sites
* Howto register and certify a site (https://wiki.egi.eu/wiki/SiteCertMan)
*** http://wiki.egi.eu/Operations/ROD/Handling_tickets ''(Or put it rather under GGUS?)''
* Howto retire a Grid component (https://wiki.egi.eu/wiki/Operations:RetiringGridComponent)
*** http://wiki.egi.eu/Operations/ROD/Howto_contact_COD
* Howto decommission an Operations Center (https://wiki.egi.eu/wiki/Operations:Operations_Centre_decommission)
*** http://wiki.egi.eu/Operations/ROD/Putting_a_site_to_a_downtime_urgently ''(Should it be here or under Downtimes?)''
* Howto register as a site, see first part of https://wiki.egi.eu/wiki/SiteCertMan
*** http://wiki.egi.eu/Operations/ROD/Dashboard ''(a reference to the Manual)''
 
*** http://wiki.egi.eu/Operations/ROD/Setting_a_downtime
https://twiki.cern.ch/twiki/bin/view/EGEE/OperationalProceduresforROCsAndSites:
*** http://wiki.egi.eu/Operations/ROD/Removing_problematic_sites
* Howto become an NGI manager (first time)
*** http://wiki.egi.eu/Operations/ROD/Removing unused sites
* Howto request for help as a site administrator
*** http://wiki.egi.eu/Operations/ROD/Removing resources
* Howto respond to a request to act on an incident as a site administrator
<!-- ===================================================================================================================================== -->
* Howto enter a downtime for a site (in GOCDB and/or in the dashboard) as a site administrator - Howto put a site into downtime for urgent matters as a ROD
** '''http://wiki.egi.eu/Operations/COD''' ''(Obsolete now?)''
* Howto report a problem (in GGUS) as a site administrator/as a user...
*** http://wiki.egi.eu/Operations/COD/Becoming ''(How to become a COD)''
* Howto handle tickets (in GGUS), as 1st level supporter, as ROD, as COD
*** http://wiki.egi.eu/Operatioins/COD/Duties''
* Howto report security issues and incidents as a site administrator (procedure?) Is a vulnerability detection treated as an incident?
*** http://wiki.egi.eu/Operations/COD/Retiring_grid_components ''(Howto retire a Grid component - https://wiki.egi.eu/wiki/Operations:RetiringGridComponent)''
 
*** http://wiki.egi.eu/wiki/Operations:Procedure_for_setting_Nagios_test_operational_for_COD ''(What to do with this?)''
* https://twiki.cern.ch/twiki/bin/view/EGEE/OperationalProceduresforROD:
*** http://wiki.egi.eu/Operations/COD/Handling_tickets ''(Or put it rather under GGUS?)''
* Howto become an operations team member (first time, get Grid Cert., register into dteam VO, subscribe to mailinglists, register into GGUS, request GOCDB access, become familiar with the following manuals: ...)
*** http://wiki.egi.eu/Operations/COD/Setting_a_downtime
* Howto work as a team member in 1st level support
<!-- ===================================================================================================================================== -->
* Howto handle new incidents as a 1st level support
** '''http://wiki.egi.eu/Operations/Tools'''
* Howto work as ROD
*** http://wiki.egi.eu/Operations/Tools/Dashboard ''(Dashboard Howto)''
* Howto handover ROD shifts
*** http://wiki.egi.eu/Operations/Tools/GOCDB
 
*** http://wiki.egi.eu/Operations/Tools/Nagios
* How does ROD interact with a site
*** http://wiki.egi.eu/Operations/Tools/MyEGI
 
*** http://wiki.egi.eu/Operations/Tools/GIIS
 
*** http://wiki.egi.eu/Operations/Tools/GGUS
'''Remarks:'''
*** http://wiki.egi.eu/Operations/Tools/Pakiti
* timeline for the whole documentation: before end of March (really?)
*** http://wiki.egi.eu/Operations/Tools/Cacti
* dashboard howto is to be put into the dashboard manual, so there's no need for having procedures/howtos concerning dashboard
*** http://wiki.egi.eu/Operations/Tools/EGI Wiki ''(Wiki as a Knowledge Base tool)''
* COD Operations manual: COD is a small group. Suggestion: COD manual should be owned by COD people, ie. content is periodically reviewed by the COD partner.
<!-- ===================================================================================================================================== -->
* connection (linking) to other specific and well-defined documentation is welcomed
** '''http://wiki.egi.eu/Operations/Others'''
 
*** http://wiki.egi.eu/Operations/Others/HEP_SPEC06
----
 
* http://wiki.egi.eu/Operations
** http://wiki.egi.eu/Operations/Joining_the_team ''(Howto become an operations team member - first-timer: get Grid Cert., register into dteam VO, subscribe to mailinglists, register into GGUS, request GOCDB access, become familiar with the following manuals: ...)''
 
* '''http://wiki.egi.eu/Operations/Downtimes''' ''(Should this be a separate entity, or should it go under Operations/GOCDB/Downtimes?)''
** http://wiki.egi.eu/Operations/Downtimes/Declaring ''(both as a site administrator and as a ROD for urgent matters)''
** http://wiki.egi.eu/Operations/Downtimes/Extending
** http://wiki.egi.eu/Operations/Downtimes/Cancelling
 
* http://wiki.egi.eu/Operations/Monitoring/ ''(Maybe Operation/Nagios would be better - more specific. Or maybe not.)''
** http://wiki.egi.eu/Operations/Monitoring/Stuff ''(Howto set Nagios test an operations test - https://wiki.egi.eu/wiki/Operations:Procedure_for_setting_Nagios_test_operational_for_COD)''
 
* '''http://wiki.egi.eu/Operations/Tickets''' ''(Should this be here or should it go onder Operations/GGUS/Tickes?)''
** http://wiki.egi.eu/Operations/Tickets/Creating ''(as a site administrator, as a user)''
** http://wiki.egi.eu/Operations/Tickets/Handling ''(as a 1st-line supporter, as ROD, as COD)''
 
* '''http://wiki.egi.eu/Operations/Incidents'''
** http://wiki.egi.eu/Operations/Incidents/Reporting
** http://wiki.egi.eu/Operations/Incidents/Handling
** http://wiki.egi.eu/Operations/Incidents/Responding_to (How to respond to incidents)
 
* '''http://wiki.egi.eu/Operations/Sites'''
** http://wiki.egi.eu/Operations/Sites/Registering ''(https://wiki.egi.eu/wiki/SiteCertMan)''
** http://wiki.egi.eu/Operations/Sites/Certifying ''(https://wiki.egi.eu/wiki/SiteCertMan)''
** http://wiki.egi.eu/Operations/Sites/Suspending
** http://wiki.egi.eu/Operations/Sites/Decommissioning
 
* '''http://wiki.egi.eu/Operations/Site_admin'''
** http://wiki.egi.eu/Operations/Site_admin/Become_one ''(Howto become an NGI manager - first time)''
** http://wiki.egi.eu/Operations/Site_admin/Asking_for_help
** http://wiki.egi.eu/Operations/Site_admin/Responding_to_incidents
 
* '''http://wiki.egi.eu/Operations/NGIs'''
** http://wiki.egi.eu/Operations/NGIs/Creation (https://wiki.egi.eu/wiki/Operations:NewNGIs_creation)
** http://wiki.egi.eu/Operations/NGIs/Tidyup (https://wiki.egi.eu/wiki/Operations:Operations_Centre_decommission)
 
* '''http://wiki.egi.eu/Operations/NGI_manager'''
** http://wiki.egi.eu/Operations/NGI_manager/Become_one ''(Howto become an NGI manager - first time)''
 
* '''http://wiki.egi.eu/Operations/1st-line_support'''
** http://wiki.egi.eu/Operations/1st-line_support/Becoming_a_1st-line_supporter
** http://wiki.egi.eu/Operations/1st-line_support/Duties ''(Howto work as a team member in 1st level support)''
** http://wiki.egi.eu/Operations/1st-line_support/Handling_incidents ''(Howto handle new incidents as a 1st level support)''
 
* '''http://wiki.egi.eu/Operations/ROD'''
** http://wiki.egi.eu/Operations/ROD/Becoming ''(How to become a ROD)''
** http://wiki.egi.eu/Operations/ROD/Duties ''(How to work as a ROD)''
** http://wiki.egi.eu/Operations/ROD/Handover ''(How to handover ROD shifts)''
** http://wiki.egi.eu/Operations/ROD/Interacting_with_sites
 
* '''http://wiki.egi.eu/Operations/COD'''
** http://wiki.egi.eu/Operations/COD/Becoming ''(How to become a COD)''
** http://wiki.egi.eu/Operations/COD/Retiring_grid_components ''(Howto retire a Grid component - https://wiki.egi.eu/wiki/Operations:RetiringGridComponent)''

Latest revision as of 15:41, 4 March 2011