Difference between revisions of "Competence centre DARIAH"

From EGIWiki
Jump to: navigation, search
(Task 3: Storing and Accessing DARIAH contents on EGI (SADE))
Line 45: Line 45:
# Italian National Institute of Nuclear Physics (INFN), Italy
# Italian National Institute of Nuclear Physics (INFN), Italy
# Gesellschaft für wissenschaftliche Datenverarbeitung mbH (GWDG), Germany
# Gesellschaft für wissenschaftliche Datenverarbeitung mbH (GWDG), Germany
# Data Archiving and Networked Services (DANS), Netherland
# Data Archiving and Networked Services (DANS), Netherlands
# Austria Academy of Science (AAS), Austria
# Austria Academy of Science (AAS), Austria

Revision as of 14:42, 23 July 2015

EGI-Engage Competence centres: Main page ELIXIR BBMRI MoBrain DARIAH LifeWatch EISCAT_3D EPOS Disaster Mitigation | EGI-Engage Knowledge Commons


CC Coordinator: Karolj Skala, Davor Davidović
Technical Coordinator:' Zoltán Farkas

CC members' list: cc-dariah AT mailman.egi.eu

Type of Competence Centre: Science-oriented

Target user communities: Digital Arts, Humanities, and Social Sciences

List of organizations representing the user communities: DARIAH-EU

Duration of the CC: 30 M

Starting at Project Month: 1 M

Ending at Project Month: 30 M

The DARIAH CC aims to widen the usage of the e-Infrastructures for Arts and Humanities research. The CC will develop and provide a workflow-based science gateway based on the generic-purpose WS-PGRADE and gLibrary technologies, adapted and tailored to the needs of users coming from the field of Arts and Humanities. The gateway will provide access and compute services for data residing in distributed grid and cloud storages. The gateway will be validated and enriched with the ‘Multi-Source Distributed Real-Time Search and Information Retrieval’ application (SIR). The CC will engage with Arts and Humanities communities to attract more applications and users to the gateway.


  1. To strengthen the collaboration between DARIAH-EU and EGI using workflow-oriented application gateways and deploying A&H applications in the EGI federated cloud (EGI FedCloud)
  2. To increase the number of accessible e-Science services and applications for the A&H researchers and integration of existing NGI resources into EGI
  3. To raise awareness of A&H researchers of the possible benefits (excellence research) of using e-Infrastructure and e-Science technologies in their research, creating conditions for a sustained increase of the user community coming from A&H and social sciences as well
  4. To widen the work started within DC-NET, INDICATE and DCH-RP project to other A&H communities


  1. Ruđer Bošković Institute (RBI, Leading partner), Croatia
  2. Hungarian Academy of Science, Institute for Computer Science and Control (MTA SZTAKI), Hungary
  3. Italian National Institute of Nuclear Physics (INFN), Italy
  4. Gesellschaft für wissenschaftliche Datenverarbeitung mbH (GWDG), Germany
  5. Data Archiving and Networked Services (DANS), Netherlands
  6. Austria Academy of Science (AAS), Austria


Task 1: User Support and Training

Duration: M6-M30, Leader: RBI

Objectives: Raise awareness of Arts and Humanities (A&H) researchers about the necessity of digital research and to qualify them to work with EGI DARIAH CC services

  • to organize Workshops and Training Courses
  • to provide ICT consulting services to A&H community
  • to provide technical support services

Task 2: DARIAH eScience Gateway on EGI

Duration: M1-M24, Leader: SZTAKI

Task 3: Storing and Accessing DARIAH contents on EGI (SADE)

Duration: M1-M12, Leader: INFN

The overall goal of this mini-project is to create a digital repository of DARIAH contents using gLibrary, the framework developed by the Italian National Institute of Nuclear Physics (INFN) to create and manage archives of digital assets (data and metadata) on local, Grid and Cloud storage resources. The digital repository will be created taking into account the requirements of the DARIAH end-users.

For this specific mini-project the data-sets will be provided by the Austrian Academy of Sciences (AAS), one of the leading Austrian research institutions with a very long-running experience and interest in the Arts and Humanities domain. The AAS datasets represent the work on a 100+ years old collection on Bavarian dialects within the Austrian-Hungarian monarchy from the beginnings of German language to nowadays. Several data types are taken into account: text, multimedia (images, audio files etc.), URIs; as well as primary collection data, interpreted data, secondary background data and geo-data with different license opportunities.

An extract is available at the website of the Database of Bavarian dialects in Austria [1] electronically mapped.

  • Headwords (about 50,000 A-Z)[2];
  • Records (about 40,000 plants; about 70,000 in general)[3];
  • Multimedia with Link to Audio-file (examples; to be improved)[4];
  • Multimedia with Collection (about 3,000; planned to be published within the mini-project)[5];
  • Multimedia connected to Headword (about 3,000; planned to be digitized)[6];
  • Project specific biographies[7];
  • Locations[8].

The AAS datasets will be orchestrated by the INFN gLibrary Digital Repository System whose high-level overview is shown in the following figure:


The repositories will be exposed to end-user through two channels:

  1. As a (series of) portlet(s) integrated both in one of the already existing Science Gateways implemented with the Catania Science Gateway Framework [9] and in the Science Gateway developed by the lighthouse project;
  2. As native apps for mobile appliances based on Android and iOS operating systems and downloadable from the official App Stores. The mobile apps will be coded using a cross-platform development environment so that other mobile operating systems could be supported, if needed. Furthermore, the apps could exploit geo-localisation services available on smartphones and tablets to find “near” contents.

Task 4: Multi-Source Distributed Real-Time Search and Information Retrieval (SIR)

Duration: M1-M12, Leader: GWDG

Task 5: Exploitation

Duration: M7-M30, Leader: DANS

Objectives: Ensure a successful transfer of the mini-projects’ results to its targeted user community and researchers and to increase the applicability and impact of the mini-projects on research conducted.

  • to increase the awareness of exploiting EGI infrastructure in the domain of A&H
  • to define the usage policy, increase applicability and impact of the mini-projects
  • to involve integrate RI resources and services into EGI
  • to disseminate the mini-project results

DARIAH EU aims to develop and maintain an eInfrastructure that supports of A&H research practices. EGI DARIAH CC will foster this aim by promoting the exploitation of EGI Grid and Cloud infrastructure to DARIAH user community. Systematic plan of approaching towards the DARIAH user community and beyond must take place by applying and integrating national roadmaps that rely on different eInfrastructure and eScience technologies. In this regard, exploitation activities will include horizontal and vertical user support. Horizontal support will be based on the dissemination of knowledge and skills and vertical by establishing of new technological solutions and environments. The grounds for this vertical component will be placed from the mini-project activities and used for designing a roadmap for sustainable A&H community involvement in EGI.

Milestones (M) and Deliverables (D)

The following gives an overview of deliverables/milestones scheduled. The 'Scope' of the deliverable defines the applicability and visibility of a deliverable. The deliverables marked as 'internal' are internal deliverables for EGI DARIAH CC and if 'external' then it is a deliverable of the EGI-Engage project that is presentable to the EC.

Code Title Lead task Lead participant Type Scope (Internal=Not sent to the EC; External=Sent to EC) Delivery PM Delivery CM Delivered date Status
User support and training plan

D1.2a Report on dissemination and training activities

Report on dissemination and training activities

D2.1 Requirements collection
T2,T3,T4 RBI
R Internal M04

D2.2 Initial technical concept and integration plan
R Internal M03

D2.3a Mini-projects progress report
R Internal M18

D2.3b Mini-projects progress report
R Internal M30

D5.1 Sustainability and user involvement plan
R Internal M12

Data repository for DARIAH SA2.6 DARIAH INFN
External M11

Final version of Multi-Source Distributed Real-Time Search and Information Retrieval application

Production level gateway for Arts and Humanities

Prototype version of the EGI-enabled application


First version of the repository in place


Dissemination activities

  • Poster presentation at 38. International convention on information and communication technology, electronics and microelectronics (MIPRO), 25-29 May 2015, Opatija, Croatia (poster)
  • Poster and 2-min oral presentation at 12th ESWC2015 conference, May 31st to June 4th 2015, Portoroz, Slovenia

Useful links