Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @

Difference between revisions of "TSA2.5 Deployed Middleware Support Unit"

From EGIWiki
Jump to navigation Jump to search
Line 118: Line 118:
*Daniel Johansson (ARC)  
*Daniel Johansson (ARC)  
*Henrik Thostrup (SGAS, ARC)
*Henrik Thostrup (SGAS, ARC)
== Ticket handling procedure ==
== Ticket priorities ==
== Followup of tickets with 3rd line support units ==

== DMSU shifts  ==
== DMSU shifts  ==

Revision as of 13:43, 17 January 2012

TPM cheat sheet

TPM DMSU Cheat Sheet
dCache *
Gridsite *
LB *
MyProxy *
VOMS * *
WMS * *

People working for DMSU

The DMSU people are categorized into two groups:

  • Assigners, i.e. the prime technical contact person pr. partner and also the one to conduct the initial assignment of tickets following the TPM assignment
  • Resolvers, extra experts at each partners to which the assigner can delegate tasks when needed either due to technical reasons or due to overload of one of the other staff members.

The lost of people, with a short list of expertise areas relevant for DMSU follow here:


Assigner: Aleš Křenek, ljocha at ics dot muni dot cz
EGI SSO login: ljocha
Expertise: LB, DPM


  • Zdeněk Salvet, salvet at ics dot muni dot cz (LB, WMS, MyProxy)
  • Zdeněk Šustr, sustr4 at cesnet dot cz (LB, WMS)
  • Daniel Kouřil, kouril at ics dot muni dot cz (Gridsite, MyProxy, VOMS and security in general)


Assigners: Rebecca Breu (rbreu), Mathilde Romberg (romberg)

Expertise: UNICORE


  • Jason Milad Daivandy


Assigner: Alessandro Paolini (alessandro.paolini at
EGI SSO login: apaolini
Expertise: CREAM, LFC, VOMS


  • Andrea Cristofori (andrea.cristofori at,WMS)
  • David Rebatto (david.rebatto at
  • Sara Bertocco (sara.bertocco at (CREAM, WMS)
  • Sergio Traldi (sergio.traldi at, WMS)
  • Elisabetta Ronchieri (elisabetta.ronchieri at (STORM)
  • Stefano Dal Pra (stefano.dalpra at
  • Paolo Veronesi (paolo.veronesi at (BDII, FTS)


Assigner: Emmanouil Paisios
EGI SSO login: epaisios
Expertise: Globus


  • None (Emmanouil is responsible for everything here)


Assigner: Jens Larsson
EGI SSO login: jens
Expertise: ARC, dCache


  • Gerd Behrmann (dCache)
  • Daniel Johansson (ARC)
  • Henrik Thostrup (SGAS, ARC)

Ticket handling procedure

Ticket priorities

Followup of tickets with 3rd line support units

DMSU shifts

The main purpose of DMSU shift is no surprise: keep the things running, not to leave an important issue without fast reaction etc.

The shifts are held by groups of people with expertise on different middleware stacks. However, due to the prevailing gLite-related traffic in DMSU only gLite shifts are formally organized currently, the other stacks are handled on the best effort basis.

The specific duties of the person on shift are:

  • to follow incoming emails from GGUS, being able to react within approx. 2 hours in normal working hours
  • to identify "top priority" and "very urgent" issues, not only by the priority set by the submitter but also by using common sense, and to make sure an appropriate expert starts looking into the issue; this includes assigning the ticket to a specific person
  • to keep checking that there is reasonable response time, namely as a reaction to further submitter's correspondence; it should be almost immediate on "top priority", and we can probably afford upto 1 week for "less urgent"

One person holds the shift for one week, the duty is passed to the other on Monday afternoon.

Shift schedule

Oct 10 Aleš Křenek
Oct 17 Alessandro Paolini
Oct 24 Zdeněk Salvet
Oct 31 Alessandro Paolini
Nov 7 Aleš Křenek
Nov 14 Sergio Traldi
Nov 21 Zdeněk Salvet
Nov 28 Sara Bertocco
Dec 5 Zdeněk Salvet
Dec 12 INFN
Dec 19 Aleš Křenek
Dec 26 best effort
Jan 2 Aleš Křenek
Jan 9 Alessandro Paolini
Jan 16 Zdeněk Salvet

DMSU Digests

Brief description and indexing of issues solved within DMSU that are likely to have broader impact on EGI Operations.

Maintained on separate page Middleware_issues_and_solutions

Operations Documentation

DMSU contributes to maintenance of EGI Operations_Manuals, in particular

Systems available for DMSU

In order to debug issues and design workaround availability to some systems is needed. Below is a list of systems available for the DMSU staff pr partner.



  • # nodes/cores: 20/80
  • OS: SL 5.2
  • Batch system: PBSPro
  • Grid m/w: gLite 3.1
  • EA Site: Y


Virtual machines for experimental services, can be scratched and reinstalled as required.

  • # nodes/cores: 2/2
  • OS: SL 5.3 x86_64
  • Batch system: N/A
  • Scheduler: N/A
  • Grid m/w and flavour: LB 2.0 (of gLite 3.2)
  • EA Site: N



# nodes/cores: a) 4/8 b) 3/6
OS: a) SL4 x86_32 b) SL5 x86_64
Batch system: torque/pbs
Scheduler: maui
Grid m/w and flavour: a) INFNGRID 3.1 (based on gLite 3.1) b) INFNGRID 3.2 (based on gLite 3.2)
EA Site: Y (for STORM service)


Linux Cluster

   * Nodes/Cores: 938/5532
   * OS: SUSE Linux Enterprise Server 10
   * Batch System/Scheduler: SGE 6.2
   * Globus 4.0.7
   * EA Site: N



   * Nodes/Cores: 66/528
   * CentOS 5.7
   * Torque (to be replaced by SLURM)
   * MOAB (to be replaced by SLURM)
   * ARC 1.1
   * EA Site

Obsolete stuff

Not used anymore but keeping the old links here.

Relevant tickets

Relevant tickets passed through DMSU and assigned to other support units are gathered here