Difference between revisions of "GPGPU-FedCloud"

From EGIWiki
Jump to: navigation, search
(Participants)
(Current status)
Line 27: Line 27:
 
  Middleware: Openstack Kilo
 
  Middleware: Openstack Kilo
  
GOCDB: IISAS-GPUCloud, https://goc.egi.eu/portal/index.php?Page_Type=Site&id=1485
+
EGI federated cloud
Openstack endpoint: https://keystone3.ui.savba.sk:5000/v2.0
+
GOCDB: IISAS-GPUCloud, https://goc.egi.eu/portal/index.php?Page_Type=Site&id=1485
OCCI endpoint: https://nova3.ui.savba.sk:8787
+
Openstack endpoint: https://keystone3.ui.savba.sk:5000/v2.0
Supported VOs: fedcloud.egi.eu, ops, dteam
+
OCCI endpoint: https://nova3.ui.savba.sk:8787
 
+
Supported VOs: fedcloud.egi.eu, ops, dteam
  
 
= Progress =
 
= Progress =

Revision as of 14:32, 8 October 2015

EGI-Engage project: Main page WP1(NA1) WP3(JRA1) WP5(SA1) PMB Deliverables and Milestones Quality Plan Risk Plan Data Plan
Roles and
responsibilities
WP2(NA2) WP4(JRA2) WP6(SA2) AMB Software and services Metrics Project Office Procedures



Objective

To provide support for accelerated computing in EGI-Engage federated cloud.


Participants

Viet Tran (IISAS)

Jan Astalos (IISAS)

Miroslav Dobrucky (IISAS)

Current status

A working site with GPGPU in EGI federated cloud

HW configuration:

IBM dx360 M4 server with two NVIDIA Tesla K20 accelerators.
Ubuntu 14.04.2 LTS with KVM/QEMU, PCI passthrough virtualization of GPU cards.

SW configuration:

Base OS: Ubuntu 14.04.2 LTS
Hypervisor: KVM
Middleware: Openstack Kilo

EGI federated cloud

GOCDB: IISAS-GPUCloud, https://goc.egi.eu/portal/index.php?Page_Type=Site&id=1485
Openstack endpoint: https://keystone3.ui.savba.sk:5000/v2.0
OCCI endpoint: https://nova3.ui.savba.sk:8787
Supported VOs: fedcloud.egi.eu, ops, dteam

Progress

  • May 2015
    • Review of available technologies
    • GPGPU virtualisation in KVM/QEMU
    • Performance testing of passthrough
HW configuration: 
IBM dx360 M4 server with two NVIDIA Tesla K20 accelerators.
Ubuntu 14.04.2 LTS with KVM/QEMU, PCI passthrough virtualization of GPU cards.
Tested application:
NAMD molecular dynamics simulation (CUDA version), STMV test example (http://www.ks.uiuc.edu/Research/namd/).
Performance results:
Tested application runs 2-3% slower in virtual machine compared to direct run on tested server.
If hyperthreading is enabled on compute server, vCPUs have to be pinned to real cores so that
whole cores will be dedicated to one VM. To avoid potential performance problems, hyperthreading 
should be switched off.

  • June 2015
    • Creating cloud site with GPGPU support
Configuration: master node, 2 worker nodes (IBM dx360 M4 servers, see above)
Base OS: Ubuntu 14.04.2 LTS
Hypervisor: KVM
Middleware: Openstack Kilo
  • July 2015
    • Creating cloud site with GPGPU support
Cloud site created at keystone3.ui.savba.sk, master + two worker nodes, configuration reported above
Creating VM images for GPGPU (based on Ubuntu 14.04, GPU driver and libraries)
  • August 2015
    • Testing cloud site with GPGPU support
Performance testing and tuning with GPGPU in Openstack 
 - comparing performance of cloud-based VM with non-cloud virtualization and physical machine, finding discrepancies and tuning them
 - setting CPU flavor in Openstack nova (performance optimization) 
 - Adjusting Openstack scheduler
Starting process of integration of the site to EGI FedCloud
 - Keystone VOMS support being integrated
 - OCCI in preparation, installation planned in September
  • September 2015
 Continue integration to EGI-FedCloud
  • Next steps
 Full integration, certification and production support

Back to Accelerated Computing task