From EGIWiki
Jump to: navigation, search
Overview For users For resource providers Infrastructure status Site-specific configuration Architecture

Federated Cloud Communities menu: Home Production use cases Under development use cases Closed use cases High level tools use cases

General Information

  • Status: Pre-production
  • Start Date: 15/10/2014
  • End Date: -
  • EGI.eu contact: Diego Scardaci / diego.scardaci@egi.eu, Carlos Gimeno Yanez / cgimeno@bifi.es
  • External contact: Luis Villazon / luis.villazon.esteban@cern.ch, Laurence Field / Laurence.Field@cern.ch

Short Description

ATLAS is a particle physics experiment at the Large Hadron Collider at CERN that is searching for new discoveries in the head-on collisions of protons of extraordinarily high energy. ATLAS will learn about the basic forces that have shaped our Universe since the beginning of time and that will determine its fate. Among the possible unknowns are extra dimensions of space, unification of fundamental forces, and evidence for dark matter candidates in the Universe. Following the discovery of the Higgs boson, further data will allow in-depth investigation of the boson's properties and thereby of the origin of mass.

The ATLAS community would like to profit of the EGI Federated Cloud resources to absorb workload peaks in ATLAS grid.

Use Case

This use case foresees the usage of cloud infrastructure broker Vac/Vcycle developed by the University of Manchester. The CERN community has developed an OCCI connector for Vcycle to access the EGI Federated Cloud resources.

More information on Vac/Vcycle are available below.


Vac is a self managing system to control virtual machines which are running on hypervisors which are not managed by an IaaS system. It is an implementation of the vacuum model whereby a VM factory runs on each physical machine. Each factory independently decides to start a VM instance or instances if on a multi-core node. The factory takes care of the VM contextualization based upon the predetermined configuration for the VO. Currently, one instance is started per job and is automatically shut down when the job terminates are no further payloads are available. Information is exchanged between the host and the guest via a directory on the host which is mounted by the guest. One key piece of information shared is the exit status of the job. If the exist status is No work available, the factory will back-off from creating machines and try again later. An aspect of this approach is that as there is no central service and hence avoids a central point of failure and is horizontally scalable. Factories may communicate with each other to achieve target shares for the specific Vac space at the site. With this approach, each VM factory can decide which VO’s VMs to run, based on site-wide target shares and on a peer-to-peer protocol in which the site’s VM factories query each other to discover which VM types they are running, and therefore identify which VO’s VMs should be started as nodes become available again. For sites where most of the resources are dedicated to a few VOs, this approach provides a straight forward solution that is no longer dependent on all the Grid or cloud machinery for these jobs.


Vcycle is an alternative implementation of the vacuum model which can be used in conjunction with IasS providers. Whereas an instance of Vac resides on each physical host, a centralized Vcycle service uses the IaaS interface to manage the VM lifecycle following the same logic as implemented in Vac. It supervises the VMs and instantiates/shutdowns VMs depending on the load coming from the experiment’s central task queue. As with VAC, if the exist status is No work available, the factory will back-off from recreating machines and try again later. Using Vcycle this way can provide elastic capacity using the resource providers it has at its disposal.

Additional Files