Task
|
Contact
|
Short description
|
Data description
|
Standards and metadata
|
Data sharing
|
Archiving and preservation
|
SA2.1/ SA2.2
|
gergely.sipos@egi.eu
|
Feedback and requirements from existing and new EGI users are collected at training events and other types of face-to-face and electronic interactions. These data must be stored, managed, analysed and used efficiently because they represent high value for the EGI community to evolve its service portfolio.
|
- Types of data: survey data - textual data, structured data (typically CSV or XLS) or graphics (usually survey summary or analysis)
- Origin of data: collected from existing and potential users of EGI
- Scale of data: few MB / year
|
The data is not in any standard format
|
- Target groups: technology provider and service developer and provider teams who contribute to the EGI service portfolio
- Scientific Impact: used for the further-development of IT services offered by the EGI Community. These services are often result of technological R&D and subject of publications in conference proceedings and peer-review journals
- Approach to sharing: A public version of the collected requirements is going to be shared in the EGI-Engage milestones and deliverables. The most important documents in this respect will be: M6.5 Joint training program for the second period (M15, May 2016), Intermediate and annual project reports (every 6 months)
|
Based on the nature of the data these can be:
|
SA2.3
|
kimmo.mattila@csc.fi
|
No scientific data will be generated within the EGI ELIXIR competence centre, however ELIXIR, as an infrastructure, does manage life science data produced by life scientists
|
- Types of data: life science data; the management of genomics data: Marine metagenomics, Plant genomics and phenotype and Human sensitive data
- Origin of data: produced and submitted by scientists. ELIXIR repositories collect, integrate and provide access to the data.
- Scale of data: The biggest data collections in life sciences are in the order of petabytes (PB), however, it is likely that the ELIXIR CC will work with smaller data sets. A single whole human genome raw data is roughly 200 GB.
|
Some standards like the standard formats in the marine or the plain domain are still under development. Some of the standards for capturing and exchanging genomic data that might be used in the use cases are described in BioSharing [R3]. Part of the data may be stored to public data repositories (e.g. ENA) that have clearly defines metadata models.
|
- Target groups: researchers interested to submit or use Metagenomics, Plant and Human data.
- Scientific Impact: scientific discoveries such as comparative environmental metagenomic analyses or finding genes related to a disease
- Approach to sharing: ELIXIR promotes open data access, but naturally human data might be sensitive therefore requires authorised access.
|
Services for archiving and preservation within ELIXIR are listed in https://www.elixir-europe.org/services.
|
SA2.5
|
Alexandre Bonvin (a.m.j.j.bonvin@uu.nl)
|
|
- Types of data: There is research data involved in the activity, but this is not produced with EGI-Engage resources, but from other EU projects. The types of data produced by those other projects are experimental NMR, Xray, SAXS and cryo-EM data.
- Origin of data: Biological samples (owned by the end users of the facilities).
- Scale of data:
|
The end results are typically deposited into public databases like the PDB or EMDB for cryo-EM data.
|
- Target groups: The raw data are usually so complex that they are only of use to expert users in structural biology that have been trained in a specific technique. The processed and derived data typically deposited in public databases are of use to researchers in life sciences in general and for biotech and pharmaceutical companies.
- Scientific Impact: This research data can underpin scientific publications.
- Approach to sharing: Data are shared via databases (e.g. again PDB, EMDB), with possibly an embargo period until publication. Other datasets (e.g. the results of computations) can be shared via EUDAT or other repositories like SBGRID for structural biology. For such an example see: https://data.sbgrid.org/dataset/131/
|
From a university perspective, data are to be kept for 10 years. Currently, there is no proper archiving mechanism in place at the particular site (Utrecht University). At the moment, policies and services rely on what is provided by the database service providers where data are deposited.
|
|
|
|
- Types of data: survey data - textual data, structured data (typically CSV or XLS) or graphics (usually survey summary or analysis)
- Origin of data: collected from existing and potential users of EGI
- Scale of data: few MB / year
|
The data is not in any standard format
|
- Target groups: technology provider and service developer and provider teams who contribute to the EGI service portfolio
- Scientific Impact: used for the further-development of IT services offered by the EGI Community. These services are often result of technological R&D and subject of publications in conference proceedings and peer-review journals
- Approach to sharing: A public version of the collected requirements is going to be shared in the EGI-Engage milestones and deliverables. The most important documents in this respect will be: M6.5 Joint training program for the second period (M15, May 2016), Intermediate and annual project reports (every 6 months)
|
|
|
|
|
- Types of data: survey data - textual data, structured data (typically CSV or XLS) or graphics (usually survey summary or analysis)
- Origin of data: collected from existing and potential users of EGI
- Scale of data: few MB / year
|
The data is not in any standard format
|
- Target groups: technology provider and service developer and provider teams who contribute to the EGI service portfolio
- Scientific Impact: used for the further-development of IT services offered by the EGI Community. These services are often result of technological R&D and subject of publications in conference proceedings and peer-review journals
- Approach to sharing: A public version of the collected requirements is going to be shared in the EGI-Engage milestones and deliverables. The most important documents in this respect will be: M6.5 Joint training program for the second period (M15, May 2016), Intermediate and annual project reports (every 6 months)
|
|
|
|
|
- Types of data: survey data - textual data, structured data (typically CSV or XLS) or graphics (usually survey summary or analysis)
- Origin of data: collected from existing and potential users of EGI
- Scale of data: few MB / year
|
The data is not in any standard format
|
- Target groups: technology provider and service developer and provider teams who contribute to the EGI service portfolio
- Scientific Impact: used for the further-development of IT services offered by the EGI Community. These services are often result of technological R&D and subject of publications in conference proceedings and peer-review journals
- Approach to sharing: A public version of the collected requirements is going to be shared in the EGI-Engage milestones and deliverables. The most important documents in this respect will be: M6.5 Joint training program for the second period (M15, May 2016), Intermediate and annual project reports (every 6 months)
|
|
|
|
|
- Types of data: survey data - textual data, structured data (typically CSV or XLS) or graphics (usually survey summary or analysis)
- Origin of data: collected from existing and potential users of EGI
- Scale of data: few MB / year
|
The data is not in any standard format
|
- Target groups: technology provider and service developer and provider teams who contribute to the EGI service portfolio
- Scientific Impact: used for the further-development of IT services offered by the EGI Community. These services are often result of technological R&D and subject of publications in conference proceedings and peer-review journals
- Approach to sharing: A public version of the collected requirements is going to be shared in the EGI-Engage milestones and deliverables. The most important documents in this respect will be: M6.5 Joint training program for the second period (M15, May 2016), Intermediate and annual project reports (every 6 months)
|
|
|
|
|
- Types of data: survey data - textual data, structured data (typically CSV or XLS) or graphics (usually survey summary or analysis)
- Origin of data: collected from existing and potential users of EGI
- Scale of data: few MB / year
|
The data is not in any standard format
|
- Target groups: technology provider and service developer and provider teams who contribute to the EGI service portfolio
- Scientific Impact: used for the further-development of IT services offered by the EGI Community. These services are often result of technological R&D and subject of publications in conference proceedings and peer-review journals
- Approach to sharing: A public version of the collected requirements is going to be shared in the EGI-Engage milestones and deliverables. The most important documents in this respect will be: M6.5 Joint training program for the second period (M15, May 2016), Intermediate and annual project reports (every 6 months)
|
|
|
|
|
- Types of data: survey data - textual data, structured data (typically CSV or XLS) or graphics (usually survey summary or analysis)
- Origin of data: collected from existing and potential users of EGI
- Scale of data: few MB / year
|
The data is not in any standard format
|
- Target groups: technology provider and service developer and provider teams who contribute to the EGI service portfolio
- Scientific Impact: used for the further-development of IT services offered by the EGI Community. These services are often result of technological R&D and subject of publications in conference proceedings and peer-review journals
- Approach to sharing: A public version of the collected requirements is going to be shared in the EGI-Engage milestones and deliverables. The most important documents in this respect will be: M6.5 Joint training program for the second period (M15, May 2016), Intermediate and annual project reports (every 6 months)
|
|
|
|
|
- Types of data: survey data - textual data, structured data (typically CSV or XLS) or graphics (usually survey summary or analysis)
- Origin of data: collected from existing and potential users of EGI
- Scale of data: few MB / year
|
The data is not in any standard format
|
- Target groups: technology provider and service developer and provider teams who contribute to the EGI service portfolio
- Scientific Impact: used for the further-development of IT services offered by the EGI Community. These services are often result of technological R&D and subject of publications in conference proceedings and peer-review journals
- Approach to sharing: A public version of the collected requirements is going to be shared in the EGI-Engage milestones and deliverables. The most important documents in this respect will be: M6.5 Joint training program for the second period (M15, May 2016), Intermediate and annual project reports (every 6 months)
|
|
|
|
|
- Types of data: survey data - textual data, structured data (typically CSV or XLS) or graphics (usually survey summary or analysis)
- Origin of data: collected from existing and potential users of EGI
- Scale of data: few MB / year
|
The data is not in any standard format
|
- Target groups: technology provider and service developer and provider teams who contribute to the EGI service portfolio
- Scientific Impact: used for the further-development of IT services offered by the EGI Community. These services are often result of technological R&D and subject of publications in conference proceedings and peer-review journals
- Approach to sharing: A public version of the collected requirements is going to be shared in the EGI-Engage milestones and deliverables. The most important documents in this respect will be: M6.5 Joint training program for the second period (M15, May 2016), Intermediate and annual project reports (every 6 months)
|
|
|
|
|
- Types of data: survey data - textual data, structured data (typically CSV or XLS) or graphics (usually survey summary or analysis)
- Origin of data: collected from existing and potential users of EGI
- Scale of data: few MB / year
|
The data is not in any standard format
|
- Target groups: technology provider and service developer and provider teams who contribute to the EGI service portfolio
- Scientific Impact: used for the further-development of IT services offered by the EGI Community. These services are often result of technological R&D and subject of publications in conference proceedings and peer-review journals
- Approach to sharing: A public version of the collected requirements is going to be shared in the EGI-Engage milestones and deliverables. The most important documents in this respect will be: M6.5 Joint training program for the second period (M15, May 2016), Intermediate and annual project reports (every 6 months)
|
|
|
|
|
- Types of data: survey data - textual data, structured data (typically CSV or XLS) or graphics (usually survey summary or analysis)
- Origin of data: collected from existing and potential users of EGI
- Scale of data: few MB / year
|
The data is not in any standard format
|
- Target groups: technology provider and service developer and provider teams who contribute to the EGI service portfolio
- Scientific Impact: used for the further-development of IT services offered by the EGI Community. These services are often result of technological R&D and subject of publications in conference proceedings and peer-review journals
- Approach to sharing: A public version of the collected requirements is going to be shared in the EGI-Engage milestones and deliverables. The most important documents in this respect will be: M6.5 Joint training program for the second period (M15, May 2016), Intermediate and annual project reports (every 6 months)
|
|
|
|
|
- Types of data: survey data - textual data, structured data (typically CSV or XLS) or graphics (usually survey summary or analysis)
- Origin of data: collected from existing and potential users of EGI
- Scale of data: few MB / year
|
The data is not in any standard format
|
- Target groups: technology provider and service developer and provider teams who contribute to the EGI service portfolio
- Scientific Impact: used for the further-development of IT services offered by the EGI Community. These services are often result of technological R&D and subject of publications in conference proceedings and peer-review journals
- Approach to sharing: A public version of the collected requirements is going to be shared in the EGI-Engage milestones and deliverables. The most important documents in this respect will be: M6.5 Joint training program for the second period (M15, May 2016), Intermediate and annual project reports (every 6 months)
|
|
|
|
|
- Types of data: survey data - textual data, structured data (typically CSV or XLS) or graphics (usually survey summary or analysis)
- Origin of data: collected from existing and potential users of EGI
- Scale of data: few MB / year
|
The data is not in any standard format
|
- Target groups: technology provider and service developer and provider teams who contribute to the EGI service portfolio
- Scientific Impact: used for the further-development of IT services offered by the EGI Community. These services are often result of technological R&D and subject of publications in conference proceedings and peer-review journals
- Approach to sharing: A public version of the collected requirements is going to be shared in the EGI-Engage milestones and deliverables. The most important documents in this respect will be: M6.5 Joint training program for the second period (M15, May 2016), Intermediate and annual project reports (every 6 months)
|
|
|
|
|
- Types of data: survey data - textual data, structured data (typically CSV or XLS) or graphics (usually survey summary or analysis)
- Origin of data: collected from existing and potential users of EGI
- Scale of data: few MB / year
|
The data is not in any standard format
|
- Target groups: technology provider and service developer and provider teams who contribute to the EGI service portfolio
- Scientific Impact: used for the further-development of IT services offered by the EGI Community. These services are often result of technological R&D and subject of publications in conference proceedings and peer-review journals
- Approach to sharing: A public version of the collected requirements is going to be shared in the EGI-Engage milestones and deliverables. The most important documents in this respect will be: M6.5 Joint training program for the second period (M15, May 2016), Intermediate and annual project reports (every 6 months)
|
|
|
|
|
- Types of data: survey data - textual data, structured data (typically CSV or XLS) or graphics (usually survey summary or analysis)
- Origin of data: collected from existing and potential users of EGI
- Scale of data: few MB / year
|
The data is not in any standard format
|
- Target groups: technology provider and service developer and provider teams who contribute to the EGI service portfolio
- Scientific Impact: used for the further-development of IT services offered by the EGI Community. These services are often result of technological R&D and subject of publications in conference proceedings and peer-review journals
- Approach to sharing: A public version of the collected requirements is going to be shared in the EGI-Engage milestones and deliverables. The most important documents in this respect will be: M6.5 Joint training program for the second period (M15, May 2016), Intermediate and annual project reports (every 6 months)
|
|
|
|
|
- Types of data: survey data - textual data, structured data (typically CSV or XLS) or graphics (usually survey summary or analysis)
- Origin of data: collected from existing and potential users of EGI
- Scale of data: few MB / year
|
The data is not in any standard format
|
- Target groups: technology provider and service developer and provider teams who contribute to the EGI service portfolio
- Scientific Impact: used for the further-development of IT services offered by the EGI Community. These services are often result of technological R&D and subject of publications in conference proceedings and peer-review journals
- Approach to sharing: A public version of the collected requirements is going to be shared in the EGI-Engage milestones and deliverables. The most important documents in this respect will be: M6.5 Joint training program for the second period (M15, May 2016), Intermediate and annual project reports (every 6 months)
|
|
|
|
|
- Types of data: survey data - textual data, structured data (typically CSV or XLS) or graphics (usually survey summary or analysis)
- Origin of data: collected from existing and potential users of EGI
- Scale of data: few MB / year
|
The data is not in any standard format
|
- Target groups: technology provider and service developer and provider teams who contribute to the EGI service portfolio
- Scientific Impact: used for the further-development of IT services offered by the EGI Community. These services are often result of technological R&D and subject of publications in conference proceedings and peer-review journals
- Approach to sharing: A public version of the collected requirements is going to be shared in the EGI-Engage milestones and deliverables. The most important documents in this respect will be: M6.5 Joint training program for the second period (M15, May 2016), Intermediate and annual project reports (every 6 months)
|
|