data infrastructures
framework for action for H2020
Event “Open Access Policy in Portugal”
Lisbon, 17 June 2013
Carlos Morais Pires European Commission
e-Infrastructures, DG CNECT.C1
Author’s views do not commit the European Commission
summary
• Policy framework related with data infrastructures
• Data as Infrastructure: Europe is "Riding the Wave"
• Implementing Interoperable Data Infrastructure
• balancing community driven and service driven initiatives
• Data Infrastructure in FP7 (examples of project initiatives)
• H2020 workprogramme under construction
• Main Messages to conclude
Policy context
A Reinforced European Research Area Partnership for Excellence and Growth, COM(2012) 392 – July 2012
Towards better access to scientific
information: boosting the benefits of public investments in research, COM(2012) 401 final - July2012
Commission, Recommendation on access and preservation of scientific information, C(2012) 4890 final – July 2012
Policy formulated Policy not formulated Formulated, not
implementing
Formulated, implementing, nooutcomes
Formulated, implementing, with outcomes
Policy formulated Policy not formulated Formulated, not
implementing
Formulated, implementing, nooutcomes
Formulated, implementing, with outcomes
Policy formulated
Policy not formulated Formulated,
not implementing
Formulated, implementing, no outcomes
Formulated, implementing,
with outcomes
Policy formulated Policy not formulatedFormulated, not implementing
Formulated, implementing, no outcomes
Formulated, implementing, with outcomes
National policy not formulated
National policy formulated but not implemented
National policy formulated, implemented but no outcomes yet National policy formulated, implemented, and outcomes delivered
OA Publications OA Data Preservation eInfrastructures
Better Access to Scientific Information
slide from Neil Jacobs (JISC) presentation to the EC
data as infrastructure: Europe is Riding the Wave
The High Level Expert Group on Scientific Data presented Riding the Wave in October 2010
Vision: "data e-infrastructure that supports seamless access, use, re-use, and trust of data.
In a sense, the physical and technical
infrastructure becomes invisible and the data themselves become the infrastructure a
valuable asset on which science, technology, the economy and society can advance".
useful definitions
Data: digital recorded factual material commonly accepted in the scientific community as necessary to validate research findings
(not include lab notebooks, preliminary analysis, drafts of scientific papers, plans for future research, peer
review reports, communication with peers, physical objects, lab specimens)
[c.f. White House Memo on "Increasing Access to the Results of Federally Funded Scientific Research"]
Data infrastructures: services, applications, tools, knowledge and policies for research data to be
discoverable, understandable, accessible, preserved and curated… and available 24/7
implementing interoperable data infrastructure
(a)data generators; research projects, big research infrastructure,
installations or medium size
laboratories, simulation centres, surveys or individual researchers (b)discipline-specific data service
providers, providing data and workflows as a service
(c)providers of generic common data services (computing centres,
libraries)
(d)researchers as users, using the data for science and engineering
community driven data infrastructure, including ESFRI, ESFRI clusters and others
data infrastructures in FP7
CNECT:
96 Meuro of EC contribution 5 Calls for proposals
First two calls (45 M): probing the European Research Data Space
Third call (4 M): FP7 OA Pilot/OpenAIRE
Fourth call (45 M): structuring the European Research Data Space along the Riding the Wave strategy
Fifth call (2M): iCORDI
Other projects, closely related with data infrastructures were funded in other parts of the programme (~80 Meuro )
distributed computing, grids,
virtual research environments, earth-server, …
thematic distributed data infrastructures in FP7
RTD:
Topics targeting thematic distributed data infrastructures or thematic networks of RI providing data services were included in all the five FP7 RTD Calls for proposals
More than 170 Meuro of EC contribution
Preparatory Phase Projects: ELIXIR, ISBE, ICOS, LIFEWATCH,
CLARIN, DARIAH,CESSDA, …
Implementation clusters: DASISH, BIOMEDBRIDGES, ENVRI, CRISP Integrating Activities: SEADATANET II, UP-GRADE-BS-SCENE,
ACTRIS, NERA, IS-ENES, INGOS, JERICO, SLING,
BBMRI-LPC, DwB, INGRID, ARIADNE, CENDARI, EHRI, … ERANET and Policy support measures: SIM4RDM, COOPEUS,
CREATIVE-B, DARECLIMED
Life Science Environment SSH Other
network infrastructure, GÉANT
distributed computing/software infrastructure scientific data infrastructure
data infrastructure:
bridging islands
bridges
Science. Set Free.
Research results. Linked.
Open. Share. Re-use.
S c i e n t i f i c Information Infrastructure
OpenAIRE - information pages OA
• The National Research Environment (research institutions, funding)
• Open Access and Repositories (awareness, repositories, journals, organizations)
• Contact details of the Open Access Desk
http://www.openaire.eu/en/nlo/country-information.html 22 more
countries…
OpenAIRE: support to research metrics
14
• Data driven research across disciplinary and geographical boundaries
• Register relevant data objects stored in certified repositories
• Virtually integrate data objects in trusted federations
• Foster advancements in interoperability of object content
• Fragmentation and heterogeneity of data require standardization
European Data Centers
Collaborative Data
Infrastructure EUDAT Scenario
DESY ARGO
MetaNet
INCF Health eChild
OA Publication
Infrastructure Open Data Infrastructures
service-driven data e-infrastructures
community-driven data e-infrastructures
SCIDIP-ES (Earth Observation Long Term Data Preservation )
Adapted from a slide of Dr. Mirco Albani (ESA), project leader of SCIDIP-ES
community-driven data e-infrastructures
The Virtual Observatory concept is a bold community-led response to the challenges the astronomical community faces in data
management and storage.
Impressive progress has been made and the momentum of the International Virtual Observatory Alliance will ensure sustained progress.
Implementation Cluster for SSH
provides solutions to a number of common issues for the five projects in social sciences and humanities
work together along four major areas of common concern: data quality, data archiving, data access and legal and ethical issues
The outcome of this work will form the basis for educational activities and for outreach to the communities of researchers that will benefit from these infrastructures
DASISH
All ESFRI Life Sciences infrastructures, coordinated by EMBL
Interoperability across data sources and services
EU funding : 10.5 M€, started in 2012
EATRIS
EMBRC
EU-Openscreen ECRIN
Euro-Bio-imaging
BSL4
BIOBANKS-BBMRI
EBI-ELIXIR INFRAFRONTIER
INSTRUCT
Implementation Cluster for
Life Science BioMedBridges
Development of common reference model , standards, and common components for data pre-processing and post- processing
Contribution to GEOSS (Global Earth Observation System of Systems) and compliance with INSPIRE EC Directive
Large participation of ICT and e-infrastructures actors (key partners from D4SCIENCE, GENESI, EGI, EUDAT, PRACE…)
EPOS EURO- ARGO
ICOS SIOS
EMSO EISCAT
LIFE- WATCH
EU funding : 3.7 M€, started in 2011
Implementation cluster
for Environment ENVRI
All ESFRI Physics, Astronomy and Analytical infrastructures, coordinated by ESRF
Seeking synergies between 11 ESFRI Projects totalling more than 9 b€ investment volume
16 project partners from 12 MS with total op. budg.: 1.5 b€/y
SKA ELI
XFEL
EUROFEL
ESRF
ESS
ILL upgrade
SPIRAL2 SLHC
ILC-HiGrade FAIR
Implementation cluster
for Physics, Astronomy CRISP
Research Data Alliance:
Common Infrastructure, Policy and Practice
Drives Data Sharing and Exchange throughout the Data Life Cycle
From Prof. Fran Berman and Prof. John Wood, Members of the RDA Council
consultation towards horizon2020
H2020 workprogramme under construction
• Community data services
• E-Infrastructure for Open Access
• Managing, preserving and computing with big research data
• Towards global data e-Infrastructures
• Skills and professions for e-infrastructures
• Integration of Core and Basic Operations Services for e-Infrastructures
• e-Infrastructures for virtual research environments (VRE)
• Centres of Excellence for computing applications
• PRACE
• Network of Competence Centres for SMEs
• GEANT These lines are related with
the content of the
Framework for Action
main messages to conclude
• Research Data "is" an Infrastructure for modern science
• Data is generated and used by disciplinary communities
• Data is stored, moved and processed by common infrastructures
• Crossing disciplinary and geographic boundaries requires
• exploring the commonalities of data infrastructures
• implement global and interoperable data infrastructures
• Policies for Open Access remove, where possible, barriers to access and share data
• H2020 will make OA to publication the rule
• H2020 will start a pilot on OA to publicly funded research data
Carlos Morais Pires