Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "Galaxy workflows with EC3"

From EGIWiki
Jump to navigation Jump to search
Line 12: Line 12:
* Install CLUE as elasticity manager
* Install CLUE as elasticity manager
* Install custom Galaxy tool (booties2)
* Install custom Galaxy tool (booties2)
* Download and register in Galaxy the Escherichia Coli Genome
== Virtual Elastic Cluster: components ==
The Elastic Cloud Computing Cluster (EC3) tool to create elastic virtual clusters on top of Infrastructure as a Service (IaaS) providers is composed by the following components:
* <b><u>[http://www.grycap.upv.es/im Infrastructure Manager (IM)]</u></b> is a tool that eases the access and the usability of IaaS clouds by automating the VMI selection, deployment, configuration, software installation, monitoring and update of Virtual Appliances. It supports APIs from a large number of virtual platforms, making user applications cloud-agnostic. In addition it integrates a contextualization system to enable the installation and configuration of all the user required applications providing the user with a fully functional infrastructure.
* <b><u>[http://imdocs.readthedocs.org/en/devel/radl.html Resource and Application Description Language (RADL)]</u></b> is a tool used to specify the requirements of the resources where the scientific applications will be executed. It must address not only hardware (CPU number, CPU architecture, RAM size, etc.) but also software requirements (applications, libraries, data base systems, etc.). It should include all the configuration details needed to get a fully functional and configured VM (a Virtual Appliance or VA). It merges the definitions of specifications, such as OVF, but using a declarative scheme, with contextualization languages such as Ansible. It also allows describing the underlying network capabilities required.
* <b><u>[http://www.grycap.upv.es/clues CLUE]</u></b> is an energy management system for High Performance Computing (HPC) Clusters and Cloud infrastructures. The main function of the system is to power off internal cluster nodes when they are not being used, and conversely to power them on when they are needed. CLUES system integrates with the cluster management middleware, such as a batch-queuing system or a cloud infrastructure management system, by means of different connectors

Revision as of 16:20, 6 October 2016

Overview For users For resource providers Infrastructure status Site-specific configuration Architecture




Galaxy workflows with EC3

Introduction

This guide is intended for researchers who want to use Galaxy, an open web-based platform for data-intensive research, in the cloud-based resources provided by the EGI Long Tail of Science Platform.

Objectives

In this guide we will show how to:

  • Deploy an elastic cluster of VMs in the RECAS-BARI resource of the EGI Federation
  • Install a Galaxy web-based platform for data intensive research with a Torque Server as backend
  • Install CLUE as elasticity manager
  • Install custom Galaxy tool (booties2)
  • Download and register in Galaxy the Escherichia Coli Genome

Virtual Elastic Cluster: components

The Elastic Cloud Computing Cluster (EC3) tool to create elastic virtual clusters on top of Infrastructure as a Service (IaaS) providers is composed by the following components:

  • Infrastructure Manager (IM) is a tool that eases the access and the usability of IaaS clouds by automating the VMI selection, deployment, configuration, software installation, monitoring and update of Virtual Appliances. It supports APIs from a large number of virtual platforms, making user applications cloud-agnostic. In addition it integrates a contextualization system to enable the installation and configuration of all the user required applications providing the user with a fully functional infrastructure.
  • Resource and Application Description Language (RADL) is a tool used to specify the requirements of the resources where the scientific applications will be executed. It must address not only hardware (CPU number, CPU architecture, RAM size, etc.) but also software requirements (applications, libraries, data base systems, etc.). It should include all the configuration details needed to get a fully functional and configured VM (a Virtual Appliance or VA). It merges the definitions of specifications, such as OVF, but using a declarative scheme, with contextualization languages such as Ansible. It also allows describing the underlying network capabilities required.
  • CLUE is an energy management system for High Performance Computing (HPC) Clusters and Cloud infrastructures. The main function of the system is to power off internal cluster nodes when they are not being used, and conversely to power them on when they are needed. CLUES system integrates with the cluster management middleware, such as a batch-queuing system or a cloud infrastructure management system, by means of different connectors