Difference between revisions of "EGI ENVRI"
Line 16: | Line 16: | ||
*'''Leader:''' Malgorzata Krakowian (malgorzata.krakowian @ egi.eu) <br> | *'''Leader:''' Malgorzata Krakowian (malgorzata.krakowian @ egi.eu) <br> | ||
*'''Start Date''': 1.02.2013<br> | *'''Start Date''': 1.02.2013<br> | ||
*'''Meetings:''' [https://indico.egi.eu/indico/categoryDisplay.py?categId=106 Indico page] (slides and minutes from meetings) | *'''Meetings:''' [https://indico.egi.eu/indico/categoryDisplay.py?categId=106 Indico page] (slides and minutes from meetings) | ||
*[http://envri.eu/eiscat_3d-study-case '''Official webpage'''] | *[http://envri.eu/eiscat_3d-study-case '''Official webpage'''] | ||
<br> | <br> | ||
Line 26: | Line 27: | ||
== <span class="toctext">Motivation</span><br> == | == <span class="toctext">Motivation</span><br> == | ||
A study case was set up to identify existing services and solutions from EGI that could address the data pre-processing, post-processing, publishing needs of these two ESFRI projects. The outcome of the pilot is expected to be directly applicable to EISCAT_3D | A study case was set up to identify existing services and solutions from EGI that could address the data pre-processing, post-processing, publishing needs of these two ESFRI projects. The outcome of the pilot is expected to be directly applicable to EISCAT_3D, and indirectly by other ESFRIs of ENVRI. In cooperation with EISCAT-3D representatives in ENVRI, EGI.eu will try to find best suitable solutions for data pre-processing of primary data and post-processing toward publishing. <br> | ||
== <span class="mw-headline" id=" | == <span class="mw-headline" id="Progress">Outcome</span> == | ||
The design of the next generation incoherent scatter radar system, EISCAT_3D, opens up opportunities for physicists to explore many new research fields. On the other hand, it also introduces significant challenges in handling large-scale experimental data which will be massively generated at great speeds and volumes. This challenge is typically referred to as a big data problem and requires solutions from beyond the capabilities of conventional database technologies. To identify existing services and new services that can tackle the EISCAT_3D big data challenge, a collaboration has been formed in February 2013 among EISCAT_3D, EGI and the EUDAT infrastructures under the ENVRI project. | |||
[[Image:EISCAT-3d.png|500px|EISCAT-3d.png]] | |||
=== Phase 1<br> === | |||
A [https://documents.egi.eu/secure/ShowDocument?docid=1839 'Towards a Big Data Strategy for EISCAT-3D' document] is emerging from the collaboration and it outlines a project that would take the first steps towards defining the EISCAT_3D big data strategy. | |||
'' | *[https://wiki.egi.eu/w/images/e/e7/ENVRI-EISCAT3D.pdf 'Towards a Big Data Strategy for EISCAT-3D' presentation] (pdf) during [http://eiscat2013.lancs.ac.uk/ 16th EISCAT International Symposium 2013 ] | ||
=== Phase 2<br> === | |||
Following questionnaires has been used to collect requirements from EISCAT data managers and scientists: | |||
For scientists: https://www.surveymonkey.com/s/ENVRI-EISCAT_Scientists | |||
For data managers: https://www.surveymonkey.com/s/ENVRI-EISCAT_Data_managers | |||
Responses: https://documents.egi.eu/public/ShowDocument?docid=1983 | |||
=== Phase 3 <br> === | |||
Development and pilot deployment of [http://sourceforge.net/projects/osgcat/ OSGC - OpenSource Geospatial Catalogue] | |||
<span class="beta" id="dev-status" /> | |||
OSGC is an Open Source implementation of an OpenSearch GeoSpatial Catalogue compliant to OGC 10-32r3 specification, developed by EGI.eu ([http://www.egi.eu/ http://www.egi.eu/]) under the ENVRI ([http://envri.eu/ http://envri.eu/]) project.<br> <br> OSGC provides a catalogue engine built on top of a PostgreSQL+Postgis database, which exposes a cusmizable OpenSearch interface. Most of the application configuration can be set from the Admin web interface, while Data Administrators have a separated Dropbox interface, which ease the management of the catalog and the data storage, and a Data Gateway interface, which controls access to data and produces data access statistics.<br> <br> A stand-alone client web interface, written in HTML5 and Javascript, can be used to query this catalog and other compliant OpenSearch catalogs. | |||
<header> | |||
'''Features''' | |||
</header> | |||
*OpenSearch catalogue engine with customizable output formats, products metadata, query schema, input formats (for ingestion). | |||
*Web admin interface (to offer the catalog as a Platform-As-A-Service on the Cloud). | |||
*Dropbox, to automatically extract metadata, register it into the catalogue and optionally push the data file into Cloud or other connected storage. | |||
*Data Gateway interface, to control access to data, produce data access statistics and bridge non-http protocols | |||
*OpenSearch web client interface, with the possibility to execute it remotely or as a standalone application (for integration into Cloud Virtual Laboratories PaaS services) and cumulative download (with shop-chart functionality). | |||
== <span id="Members" class="mw-headline">Members </span> == | |||
'''[http://egi.eu EGI.eu]'''<br> | |||
*(Study case leader & EGI Operations Officer) Malgorzata Krakowian malgorzata.krakowian [at] egi.eu | |||
*(Technical Outreach Manager) Gergely Sipos gergely.sipos [at] egi.eu | |||
*(User Community Support Officer) Nuno Ferreira nuno.ferreir [at] <span>egi</span>.eu<br> | |||
*(EGI Chief Operations Officer) Tiziana Ferrari | |||
'''[https://www.eiscat3d.se/node EISCAT-3D:]'''<br> | |||
* | *Dr. Ingemar Haggstrom Ingemar.Haggstrom [at] eiscat.se<br> | ||
<br> | |||
'''[http://envri.eu/ ENVRI:]''' | |||
*Yin Chen ChenY58 [at] cardiff.ac.uk<br> | |||
*Malcolm Atkinson | |||
*Alex Hardisty | |||
*Yannick Legré | |||
*Paul Martin | |||
*Alun Preece | |||
'''[http://eudat.eu/ EUDAT]/[http://www.csc.com/ CSC:]''' | |||
*Ari Lukkarinen ari.lukkarinen [at] csc.fi | |||
*Antti Pursula | |||
*Ville Savolainen | |||
== <span class="mw-headline" id="Resources">Resources </span><br> == | |||
= | *[https://drive.google.com/a/egi.eu/?pli=1#folders/0Bzrt5PnQFpWocjd1V2ZHeXJUamM Google doc for study case (internal for EGI)]<br> | ||
*[https://wiki.egi.eu/wiki/File:EISCAT-3D_architecture.JPG EISCAT-3D architecture (image file)]<br> | |||
'''External tools and presentations (could be useful)''' | |||
*[http://cds.cern.ch/record/1512927 The IT Challenges For Research Infrastructures In Physics] presentation (CRISP ESFRI cluster project) | |||
'''ENVRI wiki''' | |||
[[ | *[http://envri.eu/group/envri/wiki/-/wiki/Main/EISCAT_3D EISCAT-3D description ] | ||
*[http://envri.eu/group/envri/wiki/-/wiki/Main/Analyse%20Common%20Requirements%20for%20Data%20Processing Analyse Common Requirements for Data Processing ] | |||
*[http://envri.eu/group/envri/wiki/-/wiki/Main/Iceland%20Volcano%20Use%20Case Iceland Volcano Study Case ] | |||
[[Category:EGI_ENVRI]] | [[Category:EGI_ENVRI]] |
Revision as of 15:42, 12 August 2014
|
General Project Information
- Leader: Malgorzata Krakowian (malgorzata.krakowian @ egi.eu)
- Start Date: 1.02.2013
- Meetings: Indico page (slides and minutes from meetings)
- Official webpage
Mailing lists:
- envri-eiscat3d @ mailman.egi.eu - for EISCAT-3D study case
Motivation
A study case was set up to identify existing services and solutions from EGI that could address the data pre-processing, post-processing, publishing needs of these two ESFRI projects. The outcome of the pilot is expected to be directly applicable to EISCAT_3D, and indirectly by other ESFRIs of ENVRI. In cooperation with EISCAT-3D representatives in ENVRI, EGI.eu will try to find best suitable solutions for data pre-processing of primary data and post-processing toward publishing.
Outcome
The design of the next generation incoherent scatter radar system, EISCAT_3D, opens up opportunities for physicists to explore many new research fields. On the other hand, it also introduces significant challenges in handling large-scale experimental data which will be massively generated at great speeds and volumes. This challenge is typically referred to as a big data problem and requires solutions from beyond the capabilities of conventional database technologies. To identify existing services and new services that can tackle the EISCAT_3D big data challenge, a collaboration has been formed in February 2013 among EISCAT_3D, EGI and the EUDAT infrastructures under the ENVRI project.
Phase 1
A 'Towards a Big Data Strategy for EISCAT-3D' document is emerging from the collaboration and it outlines a project that would take the first steps towards defining the EISCAT_3D big data strategy.
- 'Towards a Big Data Strategy for EISCAT-3D' presentation (pdf) during 16th EISCAT International Symposium 2013
Phase 2
Following questionnaires has been used to collect requirements from EISCAT data managers and scientists:
For scientists: https://www.surveymonkey.com/s/ENVRI-EISCAT_Scientists
For data managers: https://www.surveymonkey.com/s/ENVRI-EISCAT_Data_managers
Responses: https://documents.egi.eu/public/ShowDocument?docid=1983
Phase 3
Development and pilot deployment of OSGC - OpenSource Geospatial Catalogue
OSGC is an Open Source implementation of an OpenSearch GeoSpatial Catalogue compliant to OGC 10-32r3 specification, developed by EGI.eu (http://www.egi.eu/) under the ENVRI (http://envri.eu/) project.
OSGC provides a catalogue engine built on top of a PostgreSQL+Postgis database, which exposes a cusmizable OpenSearch interface. Most of the application configuration can be set from the Admin web interface, while Data Administrators have a separated Dropbox interface, which ease the management of the catalog and the data storage, and a Data Gateway interface, which controls access to data and produces data access statistics.
A stand-alone client web interface, written in HTML5 and Javascript, can be used to query this catalog and other compliant OpenSearch catalogs.
<header> Features </header>
- OpenSearch catalogue engine with customizable output formats, products metadata, query schema, input formats (for ingestion).
- Web admin interface (to offer the catalog as a Platform-As-A-Service on the Cloud).
- Dropbox, to automatically extract metadata, register it into the catalogue and optionally push the data file into Cloud or other connected storage.
- Data Gateway interface, to control access to data, produce data access statistics and bridge non-http protocols
- OpenSearch web client interface, with the possibility to execute it remotely or as a standalone application (for integration into Cloud Virtual Laboratories PaaS services) and cumulative download (with shop-chart functionality).
Members
- (Study case leader & EGI Operations Officer) Malgorzata Krakowian malgorzata.krakowian [at] egi.eu
- (Technical Outreach Manager) Gergely Sipos gergely.sipos [at] egi.eu
- (User Community Support Officer) Nuno Ferreira nuno.ferreir [at] egi.eu
- (EGI Chief Operations Officer) Tiziana Ferrari
- Dr. Ingemar Haggstrom Ingemar.Haggstrom [at] eiscat.se
- Yin Chen ChenY58 [at] cardiff.ac.uk
- Malcolm Atkinson
- Alex Hardisty
- Yannick Legré
- Paul Martin
- Alun Preece
- Ari Lukkarinen ari.lukkarinen [at] csc.fi
- Antti Pursula
- Ville Savolainen
Resources
External tools and presentations (could be useful)
- The IT Challenges For Research Infrastructures In Physics presentation (CRISP ESFRI cluster project)
ENVRI wiki