CVMFS Task Force

From EGIWiki
Jump to: navigation, search
EGI Activity groups Special Interest groups Policy groups Virtual teams Distributed Competence Centres


Contents

Coordinator: Catalin Condurache/NGI_UK

Meetings page: Agendas

Mailing list: cvmfs-tf (at) mailman.egi.eu

Members:


Introduction

CVMFS (CERN Virtual Machine File System) has already proved to be a good solution for the distribution of application software across the resource centres, and several NGIs are offering CVMFS to national VOs.

During the April 2013 OMB in Manchester the idea of offering CVMFS as a service for all EGI VOs was discussed. It is about to have a CVMFS infrastructure that not only can be used within EGI, but also in collaboration with OSG for VOs that access resources hosted by both infrastructures.


NGI Status

NGI Status
NGI Ibergrid IBERGRID is already using CVMFS stratum 0 deployed at RAL as a service
AAROC We were planning to deploy a stratum 0 in South Africa for VO sagrid applications, as well as those in use in other sub-Saharan countries.

Update: We have finished the CI platform and are ready to include the Stratum 0 in the egi.eu distribution. Info at http://www.africa-grid.org/cvmfs and http://www.africa-grid.org/applications

NGI HR We are running CVMFS for our regional VO.
NGI FI We are one of the non-WLCG organizations running cvmfs.
OSG We recently implemented a CVMFS based service for the OSG. Our intended use case is for our virtual organizations and software group to distribute their content by this mechanism. Our design is quite simple, we allow a very few people write access and everyone read access. We are actively adding resources with access enabled, wide adoption being the goal.
NGI IT We have a very basic deployment for our catch-all VO, but it hasn't been very used until now.
NGI UK

We run stratum-0 repos (v2.0.15) for 5 VOs (mice, na62, hone, enmr.eu, phys.ibergrid.vo.eu). Our design allows installation jobs (run by VO_SGM) to write in an NFS area and then rsync to the CVMFS stratum-0 areas. For VOs not supported by GridPP UK (in terms of CPU allocations), we manually upload and maintain their CVMFS repos, but we are currently working to a web interface that will allow their own software management.

Also we run a CVMFS stratum-1 service (part of the LHC CVMFS infrastructure) which also includes replicas for the above mentioned VOs.

As plans, we look to migrate the stratum-0 nodes to v2.1.X on SL6. Also we work with CERN to replicate these 5 replicas on their stratum-1 service (3 out of 5 so far replicated). And looking to establish a network of stratum-1 servers to consolidate the non-LHC (or small) VOs cvmfs service infrastructure (tests to start soon with NIKHEF).

VO status

VO Status
VLEMED VO (1) used in some form to support production activities?

we were not until this email, but we asked around. The Dutch NGI is taking action already for non HEP users. we look forward to their findings.

(2) yes.

Johan Montagnat (1) No, we were not aware of this tool before.

(2) There is a clear need for easily deploying software packages grid-wise in the LS community. The existing solution of populating VO software space per site is often considered tedious, and leading to incoherent state when software cannot be installed on some sites due to installation job issues. So an alternative and global scale solution would be welcome. We would need to know a bit more on CVMFS though. From the slides shown, it is not clear how the software registered on the central CVMFS root directory is accessible from the sites / worker nodes in particular. (Is the CVMFS root mounted from all worker nodes? What is the performance impact of accessing to this remote software?).

Alexandre Bonvin 1. we are making use of it already. Very nice and simple. Only a few sites have that in place at this time
Jiri Chudoba - auger 1. The auger VO plans to use CVMFS. We have not yet built our own system of servers and we would like to

try "central" service.

Michael Kuss - glast.org  1. No

2. Yes

Comments to Catalin's questions/remarks in the 3-oct-2013 meeting:

Dimension of software area: 200k files in 15.4GB. These are 4 releases of the level-1 analysis package and 1 of the science analysis sw. 1GB each. The rest are common libraries, calibration db etc. The code is all rhel5 32bit compiled. This may change soon to rhel6 64bit. Hence, for a short transition period the amount may double.

Update frequency: right now 3-4 times a year, this may change to about once a month.

Sites: 8 active EGI sites. We also got enabled at two OSG sites. No sw installed (I was hoping for CVMFS).

Max file size: bummer! We have a few fits files up to 500MB, used to store the space-craft positions for long term simulations. Is 100MB a hard limit?

No overwrite: the analysis packages and libraries are versioned. I'm not sure about the calib db and other auxiliary files (like the huge fits files). What is the reasoning behind this design choice? What is about "delete"?

Objectives

Milestones

Actions

Initial working plan

It has been proposed following CVMFS-TF kick-off meeting (15 August 2013):

- Collect the expression of interest from VOs.

- Understand sites availability to install clients.

- Understand NGI availability to install regional squid.

- Mirroring between sites hosting stratus0 repositories. Creating a network of stratum1 servers.

Webinar - 5 September 2013

Aimed to provide needed information to NGIs/sites and user communities about the technical details and possible architecture.

The complete abstract for the presentation and registration details are available on INDICO at: https://indico.egi.eu/indico/conferenceDisplay.py?confId=1809

QA webinar chat window: https://wiki.egi.eu/wiki/File:CVMFS_webinar_QA_chat_window.doc

Recording - https://documents.egi.eu/public/ShowDocument?docid=1932

EGI Technical Forum - Madrid, 16 - 20 September 2013

A presentation on "CVMFS for EGI VOs" has been given during the User Community Board session (https://indico.egi.eu/indico/conferenceTimeTable.py?confId=1851#20130917) followed by Q&A and other discussions about creating the EGI CVMFS infrastructure.

Express of interest has been gathered from VOs representatives (biomed, auger, vlemed, glast.org) during the Technical Forum and steps of actions have been agreed.

CHEP 2013 - Amsterdam, 14 - 18 October 2013

A poster on "CernVM-FS – Beyond LHC Computing" has been presented (http://indico.cern.ch/getFile.py/access?contribId=392&sessionId=9&resId=0&materialId=poster&confId=214784) within the "Distributed Processing and Data Handling: Infrastructure, Sites, and Virtualization" track.

Express of interest in using CVMFS as s/w distribution mechanism has been gathered from cernatschool.org and t2k.org representatives.

Further actions have been agreed regarding Stratum-1 cross-replication between RAL Tier-1 and OSG sites.

Operations Management Board - 24 April 2014

A presentation on "CVMFS task force update" has been given during the April OMB meeting (https://indico.egi.eu/indico/materialDisplay.py?contribId=8&materialId=slides&confId=2162)

EGI Community Forum - Helsinki, 19 - 23 May 2014

A workshop "EGI services for global software and common data distribution" (https://indico.egi.eu/indico/sessionDisplay.py?sessionId=37&confId=1994#20140520) took place on Tuesday 20 May. Presentations on various aspects of deploying and using the CernVM-FS and Frontier technologies were delivered.

During the workshop, the necessity of a egi.eu CVMFS domain came into discussion. It was agreed to be hosted at RAL, but other stratum-0 sites are welcome to host *.egi.eu repositories (technical details TBC)

Also a hackathon on "Getting Started with the CernVM FileSystem or the Frontier Distributed Database Caching System" (https://indico.egi.eu/indico/contributionDisplay.py?sessionId=37&contribId=55&confId=1994) followed the workshop. Users were assisted by specialists with trying out the CernVM FileSystem and/or the Frontier Distributed Database Caching System with their own applications. Also specific site CVMFS problems were discussed and suggestions for fixing them given.

The poster "Software compatibility check framework for grid computing elements" (https://indico.egi.eu/indico/contributionDisplay.py?contribId=16&confId=2016) presented how usage of CernVM-FS overcame problems associated to the software distribution on the grid.

Following discussions with maintainers of EGI AppDB, it was agreed to create a CVMFS repository that will contain bits of software currently hosted by AppDB. It will be located under the new 'egi.eu' CVMFS domain.

New CVMFS 'egi.eu' domain active - September 2014

Work has been carried out and finalised on configuration of the new 'egi.eu' CVMFS domain. It is going to replace the 'gridpp.ac.uk' domain which accommodates the EGI VO repositories. All existing repositories (as /cvmfs/<repo_name>.gridpp.ac.uk) located at Stratum-0 (RAL) have been replicated as /cvmfs/<repo_name>.egi.eu and both domains are being replicated by Stratum-1s at RAL, NIKHEF, ASGC. Also TRIUMF is replicating the '*.egi.eu' repositories.

Few sites across the Grid have been contacted and agreed to manually configure the new domain, and tests proved successfully.

New cvmfs-keys v1.5 package available - 1 November 2014

A new cvmfs-keys v1.5-1 package has been made available (http://cernvm.cern.ch/portal/filesystem/cvmfs-keys-1.5). It mainly adds the public keys and Stratum-1 server addresses for the egi.eu and opensciencegrid.org CVMFS domains and its roll out will be of significant importance at sites supporting EGI VOs as it automatically configures the new 'egi.eu' domain. Therefore system administrators at sites are encouraged to install the package (https://cvmrepo.web.cern.ch/cvmrepo/yum/cvmfs/EL/5/x86_64/cvmfs-keys-1.5-1.noarch.rpm).

Results

EGI CVMFS Deployment Status
Site Stratum-0 Stratum-1 Squid Clients
RAL-LCG2

mice, na62, hone, phys.vo.ibergrid.eu, enmr.eu, glast.org, hyperk.org, t2k.org, cernatschool.org, biomed, snoplus.snolab.ca, auger, km3net.org, pheno - yes on stratum-0 v2.1 (on both gridpp.ac.uk and egi.eu domains)

comet.j-parc.jp - in progress

repositories replicated by dedicated EGI Stratum-1

vlemed, *.desy.de and oasis.opensciencegrid.org repos replicated by EGI Stratum-1

yes all VOs
Tier-2s UK as requested (mice, na62, hyperk.org, t2k.ork)
ZA_UJ enmr.eu
OSG oasis.opensciencegrid.org replicates the non-LHC (gridpp.ac.uk) repositories from RAL-LCG2 as requested, enmr, auger, geant4
CERN replicates mice, na62, hone, phys-ibergrid and wenmr (gridpp.ac.uk) repos from RAL-LCG2
NIKHEF vlemed on stratum-0 v2.1 replicates all *.gridpp.ac.uk and *.egi.eu repos from RAL-LCG2
DESY ilc, calice, hermes, hone, olympus, xfel, zeus on stratum-0 v2.1
ASGC replicates all *.gridpp.ac.uk and *.egi.eu repos from RAL-LCG2
TRIUMF replicates the entire 'egi.eu' domain from RAL-LCG2

Configurations

For the 'egi.eu' repositories (auger, biomed, cernatschool, glast, hyperk, km3net, mice, na62, pheno, phys-ibergrid, snoplus, t2k, wenmr) please install the latest cvmfs-keys v1.5-1 RPM (https://cvmrepo.web.cern.ch/cvmrepo/yum/cvmfs/EL/5/x86_64/cvmfs-keys-1.5-1.noarch.rpm). It practically automatically configures the 'egi.eu' domain by adding the public keys and Stratum-1 server addresses. It does the same things for the 'opensciencegrid.org' domain as well.

For other repositories see below.

Recommended Configurations at Replicas and Clients Level
VO Variables Values
ilc
CVMFS_STRATUM0
http://grid-cvmfs-null.desy.de:8000/cvmfs/ilc.desy.de
CVMFS_SERVER_URL
http://grid-cvmfs-one.desy.de:8000/cvmfs/ilc.desy.de
http://cvmfs-stratum-one.cern.ch/cvmfs/ilc.desy.de
calice
CVMFS_STRATUM0
http://grid-cvmfs-null.desy.de:8000/cvmfs/calice.desy.de
CVMFS_SERVER_URL
http://grid-cvmfs-one.desy.de:8000/cvmfs/calice.desy.de
http://cvmfs-stratum-one.cern.ch/cvmfs/calice.desy.de
hone
CVMFS_STRATUM0
http://grid-cvmfs-null.desy.de:8000/cvmfs/hone.desy.de
CVMFS_SERVER_URL
http://grid-cvmfs-one.desy.de:8000/cvmfs/hone.desy.de
http://cvmfs-stratum-one.cern.ch/cvmfs/hone.desy.de
hermes
CVMFS_STRATUM0
http://grid-cvmfs-null.desy.de:8000/cvmfs/hermes.desy.de
CVMFS_SERVER_URL
http://grid-cvmfs-one.desy.de:8000/cvmfs/hermes.desy.de
http://cvmfs-stratum-one.cern.ch/cvmfs/hermes.desy.de
vlemed
CVMFS_STRATUM0
http://mesthoop.nikhef.nl/cvmfs/vlemed.amc.nl
CVMFS_SERVER_URL
http://cvmfs01.nikhef.nl/cvmfs/vlemed.amc.nl
http://cvmfs-egi.gridpp.rl.ac.uk:8000/cvmfs/vlemed.amc.nl

Useful Links

CVMFS home page http://cernvm.cern.ch/portal/filesystem

CVMFS - Beyond LHC Computing https://indico.egi.eu/indico/getFile.py/access?contribId=7&resId=0&materialId=slides&confId=1235

RAL Tier1 CVMFS https://www.gridpp.ac.uk/wiki/RAL_Tier1_CVMFS

CVMFS for non LHC VOs https://www.gridpp.ac.uk/wiki/RALnonLHCCVMFS

Personal tools
Namespaces
Variants
Actions
Navigation
Toolbox
Print/export