Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "Agenda-20-06-2011"

From EGIWiki
Jump to navigation Jump to search
 
(11 intermediate revisions by 2 users not shown)
Line 1: Line 1:
{{Template:Op menubar}}  
{{Template:Op menubar}}  
[[Category:Grid Operations Meetings]]


''' WARNING: The meeting is being moved from 20th of June to 20th of June, because 13th of June is holiday in Netherlands and other countries. Sorry for the inconvenience.'''
'''WARNING: The meeting is being moved from 20th of June to 20th of June, because 13th of June is holiday in Netherlands and other countries. Sorry for the inconvenience.'''  


= Detailed agenda: Grid Operations Meeting 13 June 2011 14h00 Amsterdam time  =
= Detailed agenda: Grid Operations Meeting 20 June 2011 14h00 Amsterdam time  =


[http://evo.caltech.edu/evoNext/koala.jnlp?meeting=MeMMMu222BDiD89v9sDv9e EVO direct link] pwd:gridops || [https://www.egi.eu/indico/materialDisplay.py?materialId=2&confId=495 EVO details] || [https://www.egi.eu/indico/conferenceDisplay.py?confId=49 Indico page]
[http://evo.caltech.edu/evoNext/koala.jnlp?meeting=MeMMMu222BDiD89v9sDv9e EVO direct link] pwd:gridops || [https://www.egi.eu/indico/materialDisplay.py?materialId=2&confId=495 EVO details] || [https://www.egi.eu/indico/conferenceDisplay.py?confId=495 Indico page]  


== 1. Middleware releases and staged rollout  ==
== 1. Middleware releases and staged rollout  ==
=== 1.1 EMI-1 release status (Cristina)===
 
=== 1.1 EMI-1 release status (Cristina) ===
[https://www.egi.eu/indico/materialDisplay.py?contribId=0&materialId=0&confId=495 EMI release schedule (PDF)]<br>
Slides: [https://www.egi.eu/indico/getFile.py/access?contribId=0&resId=0&materialId=slides&confId=495 PDF][https://www.egi.eu/indico/getFile.py/access?contribId=0&resId=1&materialId=slides&confId=495 PDF]


=== 1.2. EMI/UMD current status  ===
=== 1.2. EMI/UMD current status  ===


==== 1.3. Staged Rollout (Mario)  ====
==== 1.3. Staged Rollout (Mario)  ====
===== 1.3.1 gLite 3.1 series<br>  =====
*LFC 1.8.0-1, patches in staged rollout state with no EA - Decision was to close those patches since there was no interest shown from production sites, and the support calendar has ended.<br>
===== 1.3.2 gLite 3.2 series  =====
*CREAM&nbsp;1.6.6 and glite-SGE_utils are under staged rollout - fix a bug seen in the previous staged rollout<br>
*L&amp;B 2.1.21:&nbsp;staged rollout done, report produced - possibly will be released together with cream and sge.
===== 1.3.3 EMI1 - UMD1<br>  =====
{| width="601" height="624" cellspacing="0" cellpadding="0" border="2" style="border-collapse: collapse;"
|-
! width="174" height="12" class="xl25" scope="col" | [[Product - sw-rel Ticket]]
! width="65" class="xl25" scope="col" | Verification
! width="84" class="xl25" scope="col" | Staged Rollout
! width="65" class="xl25" scope="col" | ET (Finish)
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2202 EMI.apel.sl5.x86_64]
| class="xl24" | DONE
| class="xl24" | OnGoing
| align="right" class="xl29" | 28-Jun
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2232 EMI.arc-ce.sl5.x86_64]
| class="xl24" | OnGoing
| class="xl24" | &nbsp;
| class="xl24" | &nbsp;
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2234 EMI.arc-infosys.sl5.x86_64]
| class="xl24" | OnGoing
| class="xl24" | &nbsp;
| class="xl24" | &nbsp;
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2203 EMI.argus.sl5.x86_64]
| class="xl24" | DONE
| class="xl24" | OnGoing
| align="right" class="xl29" | 28-Jun
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2204 EMI.bdii-site.sl5.x86_64]
| class="xl24" | DONE
| class="xl24" | OnGoing
| align="right" class="xl29" | 23-Jun
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2205 EMI.bdii-top.sl5.x86_64]
| class="xl24" | DONE
| class="xl24" | OnGoing
| align="right" class="xl29" | 23-Jun
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2207 EMI.cluster.sl5.x86_64]
| class="xl24" | OnGoing
| class="xl24" | &nbsp;
| class="xl24" | &nbsp;
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2206 EMI.cream.sl5.x86_64]
| class="xl24" | DONE
| class="xl24" | OnGoing
| align="right" class="xl29" | 28-Jun
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2238 EMI.dcache.sl5.x86_64]
| class="xl24" | Not Started
| class="xl24" | &nbsp;
| class="xl24" | &nbsp;
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2212 EMI.dgas.sl5.x86_64]
| class="xl24" | DONE
| class="xl24" | OnGoing
| align="right" class="xl29" | 28-Jun
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2213 EMI.dpm.sl5.x86_64]
| class="xl24" | DONE
| class="xl24" | OnGoing
| align="right" class="xl29" | 28-Jun
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2214 EMI.glexec_wn.sl5.x86_64]
| class="xl24" | DONE
| class="xl24" | OnGoing
| align="right" class="xl29" | 28-Jun
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2216 EMI.lb.sl5.x86_64]
| class="xl24" | OnGoing
| class="xl24" | &nbsp;
| class="xl24" | &nbsp;
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2217 EMI.lfc_mysql.sl5.x86_64]
| class="xl24" | OnGoing
| class="xl24" | &nbsp;
| class="xl24" | &nbsp;
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2218 EMI.lfc_oracle.sl5.x86_64]
| class="xl27" | onHOLD
| class="xl24" | &nbsp;
| class="xl24" | &nbsp;
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2208 EMI.lsf-utils.sl5.x86_64]
| class="xl24" | DONE
| class="xl24" | OnGoing
| align="right" class="xl29" | 28-Jun
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2215 EMI.mpi.sl5.x86_64]
| class="xl28" | Rejected
| class="xl24" | &nbsp;
| class="xl24" | &nbsp;
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2237 EMI.proxyrenewal.sl5.x86_64]
| class="xl24" | DONE
| class="xl24" | OnGoing
| align="right" class="xl29" | 23-Jun
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2210 EMI.torque-client.sl5.x86_64]
| class="xl24" | DONE
| class="xl24" | OnGoing
| align="right" class="xl29" | 28-Jun
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2211 EMI.torque-server.sl5.x86_64]
| class="xl24" | DONE
| class="xl24" | OnGoing
| align="right" class="xl29" | 23-Jun
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2209 EMI.torque-utils.sl5.x86_64]
| class="xl24" | DONE
| class="xl24" | OnGoing
| align="right" class="xl29" | 23-Jun
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2222 EMI.ui.sl5.x86_64]
| class="xl24" | DONE
| class="xl24" | OnGoing
| align="right" class="xl29" | 28-Jun
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2224 EMI.unicore-gateway.sl5.x86_64]
| class="xl24" | DONE
| class="xl24" | OnGoing
| align="right" class="xl29" | 28-Jun
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2226 EMI.unicore-hila.sl5.x86_64]
| class="xl24" | DONE
| class="xl24" | OnGoing
| align="right" class="xl29" | 28-Jun
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2229 EMI.unicore-registry.sl5.x86_64]
| class="xl28" | Rejected
| class="xl24" | &nbsp;
| class="xl24" | &nbsp;
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2231 EMI.unicore-tsi.sl5.x86_64]
| class="xl24" | DONE
| class="xl24" | OnGoing
| align="right" class="xl29" | 28-Jun
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2228 EMI.unicore-uvos.sl5.x86_64]
| class="xl24" | DONE
| class="xl24" | OnGoing
| align="right" class="xl29" | 28-Jun
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2230 EMI.unicore-ws.sl5.x86_64]
| class="xl24" | DONE
| class="xl24" | OnGoing
| align="right" class="xl29" | 28-Jun
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2227 EMI.unicore-xuudb.sl5.x86_64]
| class="xl24" | DONE
| class="xl24" | OnGoing
| align="right" class="xl29" | 28-Jun
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2219 EMI.voms_mysql.sl5.x86_64]
| class="xl24" | DONE
| class="xl24" | OnGoing
| align="right" class="xl29" | 23-Jun
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2220 EMI.voms_oracle.sl5.x86_64]
| class="xl27" | onHOLD
| class="xl24" | &nbsp;
| class="xl24" | &nbsp;
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2221 EMI.wms.sl5.x86_64]
| class="xl28" | Rejected
| class="xl24" | &nbsp;
| class="xl24" | &nbsp;
|-
| width="174" height="12" class="xl26" | [https://rt.egi.eu/rt/Ticket/Display.html?id=2223 EMI.wn.sl5.x86_64]
| class="xl24" | DONE
| class="xl24" | OnGoing
| align="right" class="xl29" | 28-Jun
|}
<br>
*Requesting at OMB&nbsp;tomorrow, a delay of one week for the UMD&nbsp;official annoucement, from 4 July to '''11 July'''.
*Staged rollout tests with date of 23 June have been done or near finalization, reports already received, other expected until 28 June.
*L&amp;B&nbsp;and LFC_mysql '''verification''' reports are expected today and tomorrow, and will go to staged rollout next.
*MPI&nbsp;rejection:&nbsp;a new version has already been produced in EMI, and will be released soon.
*WMS:&nbsp;rejected due to several problems found in verification. One is fixed in the EMI&nbsp;update of last week, but there are still others that need to be fixed&nbsp;- proxyrenewal in particular. Discussing with EMI&nbsp;if we can have all this fixed soon so that it can still enter in the SW&nbsp;provisioning process and be released in UMD1.
*StoRM:&nbsp;will be released in an EMI&nbsp;update this week.
*UNICORE-Registry:&nbsp;rejected due to a technical mistake in the SW&nbsp;provisioning process (not due to a bug), will be released in the next update.


==== 1.4 Interoperability (Michaela)  ====
==== 1.4 Interoperability (Michaela)  ====


=== 2. Operational Issues  ===


=== 2. Operational Issues  ===
==== 2.1 Requirement #[https://rt.egi.eu/rt/Ticket/Display.html?id=1388 1388] ====
==== 2.1 Requirement #[https://rt.egi.eu/rt/Ticket/Display.html?id=1388 1388] ====
 
NGI_IT requested documentation about guidelines on how to migrate a service preserving its state (e.g. an SE), on how to set up hot-failover for stateful services.
NGI_IT requested documentation about guidelines on how to migrate a service preserving its state (e.g. an SE), on how to set up hot-failover for stateful services.  
*Instruction on what is needed to be copied and preserved.
 
*Instruction on what is needed to be copied and preserved.  
*Instruction on how to set up failover or redundant instances of the same service.
*Instruction on how to set up failover or redundant instances of the same service.


NGI_IT provided parsed EMI-1 products documentation and highlighted the components that are missing documentation on this issues.
NGI_IT provided parsed EMI-1 products documentation and highlighted the components that are missing documentation on this issues.
 
===== 2.1.1 Services that do not need documentation  =====
 
The following services already report in their documentation the procedures to migrate the service maintaining the state, and a description of an high availability configuration.  


===== 2.1.1 Services that do not need documentation =====
*AMGA  
The following services already report in their documentation the procedures to migrate the service maintaining the state, and a description of an high availability configuration.
*APEL publisher (''HA'' not docuemented but not critical)  
* AMGA
*dCache  
* APEL publisher (''HA'' not docuemented but not critical)
*VOMS  
* dCache
**VOMS-Oracle replica not documented  
* VOMS
*WMS  
** VOMS-Oracle replica not documented  
**Stateless service, documentation about critical log files to save/maintain  
* WMS
**Failover and load balancing available for both server/client side  
** Stateless service, documentation about critical log files to save/maintain
*LB  
** Failover and load balancing available for both server/client side
**HA not documented, but WMS can be configured to point to multiple LB
* LB
** HA not documented, but WMS can be configured to point to multiple LB


===== 2.1.2 Services that miss documentation =====
===== 2.1.2 Services that miss documentation =====


{| border="1"  
{| border="1"
|+ align="bottom" style="caption-side: bottom" | '''V''': documentation ok. '''?''': documentation existing but not complete. '''X''': missing documentation
|+ '''V''': documentation ok. '''?''': documentation existing but not complete. '''X''': missing documentation  
|-
|-
!Service  
! Service  
!How to migrate the service
! How to migrate the service  
!How to set up failover load balancing
! How to set up failover load balancing  
!Comments
! Comments
|-
|-
|LFC
| LFC  
|'''?'''
| '''?'''  
|'''?'''
| '''?'''  
| Documentation available for both HA and migration. But the docs may be not up-to-date. The load balancing tecnique based on DNS round robin is missing.
| Documentation available for both HA and migration. But the docs may be not up-to-date. The load balancing tecnique based on DNS round robin is missing.
|-
|-
|BDII
| BDII  
|'''V''' (not applicable)
| '''V''' (not applicable)  
|'''X'''
| '''X'''  
|Service stateless, no documentation available about HA and load balancing
| Service stateless, no documentation available about HA and load balancing
|-
|-
|DPM
| DPM  
|'''V'''
| '''V'''  
|'''?'''
| '''?'''  
|The DPM architecture is modular, but deployment scenarios are not well covered in the documentation
| The DPM architecture is modular, but deployment scenarios are not well covered in the documentation
|-
|-
|CREAM
| CREAM  
|'''?'''  
| '''?'''  
|'''X'''
| '''X'''  
|Cream is a stateless service, but for accounting and security purpose, some logs must be preserved to be compliant to EGI recommendation. It is not clear which files are important and which are not.
| Cream is a stateless service, but for accounting and security purpose, some logs must be preserved to be compliant to EGI recommendation. It is not clear which files are important and which are not.
|-
|-
|ARGUS
| ARGUS  
|'''X'''
| '''X'''  
|'''X'''
| '''X'''  
|  
| <br>
|}
|}


===== 2.1.3 Other middleware stacks? =====
===== 2.1.3 Other middleware stacks? =====
NGI_IT gathered information only about gLite components, where they have more experience. All the EMI services will be deployed from scratch. <br>
 
Are the same informations needed also for '''ARC''' components? The interested NGIs could parse the available documentation and report the component that miss information.
NGI_IT gathered information only about gLite components, where they have more experience. All the EMI services will be deployed from scratch. <br> Are the same informations needed also for '''ARC''' components? The interested NGIs could parse the available documentation and report the component that miss information. <br> UNICORE?


=== 3. AOB  ===
=== 3. AOB  ===


==== 3.1 ====
==== 3.1 ====
Next Meeting:
 
Next Meeting:  


[[Category:GridOpsMeeting]]
<br>

Latest revision as of 11:32, 15 October 2012

Main EGI.eu operations services Support Documentation Tools Activities Performance Technology Catch-all Services Resource Allocation Security

WARNING: The meeting is being moved from 20th of June to 20th of June, because 13th of June is holiday in Netherlands and other countries. Sorry for the inconvenience.

Detailed agenda: Grid Operations Meeting 20 June 2011 14h00 Amsterdam time

EVO direct link pwd:gridops || EVO details || Indico page

1. Middleware releases and staged rollout

1.1 EMI-1 release status (Cristina)

EMI release schedule (PDF)
Slides: PDFPDF

1.2. EMI/UMD current status

1.3. Staged Rollout (Mario)

1.3.1 gLite 3.1 series
  • LFC 1.8.0-1, patches in staged rollout state with no EA - Decision was to close those patches since there was no interest shown from production sites, and the support calendar has ended.
1.3.2 gLite 3.2 series
  • CREAM 1.6.6 and glite-SGE_utils are under staged rollout - fix a bug seen in the previous staged rollout
  • L&B 2.1.21: staged rollout done, report produced - possibly will be released together with cream and sge.
1.3.3 EMI1 - UMD1
Product - sw-rel Ticket Verification Staged Rollout ET (Finish)
EMI.apel.sl5.x86_64 DONE OnGoing 28-Jun
EMI.arc-ce.sl5.x86_64 OnGoing    
EMI.arc-infosys.sl5.x86_64 OnGoing    
EMI.argus.sl5.x86_64 DONE OnGoing 28-Jun
EMI.bdii-site.sl5.x86_64 DONE OnGoing 23-Jun
EMI.bdii-top.sl5.x86_64 DONE OnGoing 23-Jun
EMI.cluster.sl5.x86_64 OnGoing    
EMI.cream.sl5.x86_64 DONE OnGoing 28-Jun
EMI.dcache.sl5.x86_64 Not Started    
EMI.dgas.sl5.x86_64 DONE OnGoing 28-Jun
EMI.dpm.sl5.x86_64 DONE OnGoing 28-Jun
EMI.glexec_wn.sl5.x86_64 DONE OnGoing 28-Jun
EMI.lb.sl5.x86_64 OnGoing    
EMI.lfc_mysql.sl5.x86_64 OnGoing    
EMI.lfc_oracle.sl5.x86_64 onHOLD    
EMI.lsf-utils.sl5.x86_64 DONE OnGoing 28-Jun
EMI.mpi.sl5.x86_64 Rejected    
EMI.proxyrenewal.sl5.x86_64 DONE OnGoing 23-Jun
EMI.torque-client.sl5.x86_64 DONE OnGoing 28-Jun
EMI.torque-server.sl5.x86_64 DONE OnGoing 23-Jun
EMI.torque-utils.sl5.x86_64 DONE OnGoing 23-Jun
EMI.ui.sl5.x86_64 DONE OnGoing 28-Jun
EMI.unicore-gateway.sl5.x86_64 DONE OnGoing 28-Jun
EMI.unicore-hila.sl5.x86_64 DONE OnGoing 28-Jun
EMI.unicore-registry.sl5.x86_64 Rejected    
EMI.unicore-tsi.sl5.x86_64 DONE OnGoing 28-Jun
EMI.unicore-uvos.sl5.x86_64 DONE OnGoing 28-Jun
EMI.unicore-ws.sl5.x86_64 DONE OnGoing 28-Jun
EMI.unicore-xuudb.sl5.x86_64 DONE OnGoing 28-Jun
EMI.voms_mysql.sl5.x86_64 DONE OnGoing 23-Jun
EMI.voms_oracle.sl5.x86_64 onHOLD    
EMI.wms.sl5.x86_64 Rejected    
EMI.wn.sl5.x86_64 DONE OnGoing 28-Jun


  • Requesting at OMB tomorrow, a delay of one week for the UMD official annoucement, from 4 July to 11 July.
  • Staged rollout tests with date of 23 June have been done or near finalization, reports already received, other expected until 28 June.
  • L&B and LFC_mysql verification reports are expected today and tomorrow, and will go to staged rollout next.
  • MPI rejection: a new version has already been produced in EMI, and will be released soon.
  • WMS: rejected due to several problems found in verification. One is fixed in the EMI update of last week, but there are still others that need to be fixed - proxyrenewal in particular. Discussing with EMI if we can have all this fixed soon so that it can still enter in the SW provisioning process and be released in UMD1.
  • StoRM: will be released in an EMI update this week.
  • UNICORE-Registry: rejected due to a technical mistake in the SW provisioning process (not due to a bug), will be released in the next update.

1.4 Interoperability (Michaela)

2. Operational Issues

2.1 Requirement #1388

NGI_IT requested documentation about guidelines on how to migrate a service preserving its state (e.g. an SE), on how to set up hot-failover for stateful services.

  • Instruction on what is needed to be copied and preserved.
  • Instruction on how to set up failover or redundant instances of the same service.

NGI_IT provided parsed EMI-1 products documentation and highlighted the components that are missing documentation on this issues.

2.1.1 Services that do not need documentation

The following services already report in their documentation the procedures to migrate the service maintaining the state, and a description of an high availability configuration.

  • AMGA
  • APEL publisher (HA not docuemented but not critical)
  • dCache
  • VOMS
    • VOMS-Oracle replica not documented
  • WMS
    • Stateless service, documentation about critical log files to save/maintain
    • Failover and load balancing available for both server/client side
  • LB
    • HA not documented, but WMS can be configured to point to multiple LB
2.1.2 Services that miss documentation
V: documentation ok. ?: documentation existing but not complete. X: missing documentation
Service How to migrate the service How to set up failover load balancing Comments
LFC ? ? Documentation available for both HA and migration. But the docs may be not up-to-date. The load balancing tecnique based on DNS round robin is missing.
BDII V (not applicable) X Service stateless, no documentation available about HA and load balancing
DPM V ? The DPM architecture is modular, but deployment scenarios are not well covered in the documentation
CREAM ? X Cream is a stateless service, but for accounting and security purpose, some logs must be preserved to be compliant to EGI recommendation. It is not clear which files are important and which are not.
ARGUS X X
2.1.3 Other middleware stacks?

NGI_IT gathered information only about gLite components, where they have more experience. All the EMI services will be deployed from scratch.
Are the same informations needed also for ARC components? The interested NGIs could parse the available documentation and report the component that miss information.
UNICORE?

3. AOB

3.1

Next Meeting: