• No results found

DATA MANAGEMENT PLAN IN THE REAL LIFE SCIENCES

N/A
N/A
Protected

Academic year: 2021

Share "DATA MANAGEMENT PLAN IN THE REAL LIFE SCIENCES"

Copied!
66
0
0

Loading.... (view fulltext now)

Full text

(1)

DATA MANAGEMENT PLAN IN THE REAL LIFE…

SCIENCES

Yvan Le Bras Cyril Monjeaud Olivier Collin Jacques Nicolas

(2)

Context

Kahn. On the future of genomic data. Science (2011) vol. 331 (6018) pp. 728-9

 Now : Genomics : Next Generation Sequencing  Now : Proteomics  Next : Bio-imaging  Digital data  Huge amount  Heterogenous

 Critical situation for some laboratories

(3)

Context

• Exchange from one domain to another

• From ICT / IT to scientific domains

• Between scientific domains

(4)

E-BIOGENOUEST

From the biogenouest project to the first french e-Science center : CeSGO

(5)

E-Biogenouest

• Started in May 2012 for 3 years

• Funded by Brittany and Pays de la Loire

• E-science initiative for the Biogenouest network

• Test an e-Science approach

(6)

E-Biogenouest

• Started in May 2012 for 3 years

• Funded by Brittany and Pays de la Loire

• E-science initiative for the Biogenouest network

• Test an e-Science approach

• Roadmap preparation

An innovative VRE concept More than 120 scientists trained! More than 200 users!

1669 meetings ;) -Mission interdisciplinarité CNRS -PIA -IFB -Fce Génomique -Rapsodyn -Sciences citoyennes -UEB C@mpus -CPER -FRM -INCa -H2020 Health IT Agro Environment 7 submitted publications

(7)

VRE: a tool for e-Science application

Virtual Research Environment

Web portal Data softwares Processing resources User Community Collaboration

(8)

An innovative VRE approach

• Research Lifecycle

• Open source solutions

• Don’t reinvente the wheel

http://www.jisc.ac.uk/whatwedo/campaigns/res3/jischelp.aspx#simulate

Mutualise

win win

(9)

Communauté

HubZero

Galaxy EMME

Continuum

Continuum data management & analysis

(10)

HUBzero : Scientifique collaborative platform

eBGO HUB

HUBzero to share knowledge and manage groups and

projects Informations 218 users 111 projects 53 groups 729 resources

> 400 uniq users uniques by month …

Purdue University

(11)

ISAtools : Experimental data management

EMME

ISAtools suite to store data & metadata

Fonctionalities

-based on biomed ontologies

-bridge between existing biomed standards -format publication submission

-Pydio to upload data

-biological investigation repository (data + metadata)

Oxford eResearch Centre

(12)

Galaxy : Data analysis web platform

GALAXY by GenOuest

To analyse & share data as processes and tools

Informations

34917 jobs 150 users

More than 800 outils

Share

- data - histories - workflows - tools

Penn state university

(13)

Pydio : File sharing platform

Pydio by GenOuest

To store & share data as links

Informations

-Galaxy workspace -EMME workspace -INCa workspace

Share

- data via URI - control

- safety - privacy

Abstrium SAS

(14)

• For society

Open Science and open data

• For end users scientists communities

Data management plan

Preserve, access, share & visualise (data & analytics porocesses)

• Help for project management

• For ICT

• Facilitate the use of tools Research Service

• Accelerate switch between dev to production state

Optimise infrastructures use (storage, computing & network…)

• Infrastructure for data infastructure of data

(15)

DMP ON THE LINE

(16)
(17)
(18)

Data storage

(19)
(20)
(21)

Metadata management

(22)

Metadata management

(23)

Metadata management

(24)

Metadata management

(25)

Metadata management

(26)

Metadata management

(27)

Metadata management

(28)

Metadata management

(29)

Metadata management

(30)

Metadata management

(31)

Metadata management

(32)

Metadata management

(33)

Metadata management

(34)

Metadata management

(35)

Metadata management

(36)

Metadata management

(37)

Metadata management

(38)

Metadata management

(39)

Metadata management

(40)

Metadata management

(41)

Metadata management

(42)

Metadata management

(43)

Data analysis

(44)

Data analysis

(45)

Data analysis

(46)

Data analysis

(47)

Data analysis

(48)

Data analysis

(49)

Data analysis

(50)

Data analysis

(51)

Data analysis

(52)

Data analysis

(53)

Data analysis

(54)

Metadata repository

(55)

Metadata repository

(56)

Metadata repository

(57)

CeSGO & DMP

• Données administratives • Dénomination du projet • Description du projet • Nom / ID du responsable • Agence de financement • Version du DMP

• Politique appliquée aux données

• Responsabilités et ressources

• Collecte / création de données

• Description du jeu de données

• Protocole

• Méthode

• Equipements

• Assurance qualité appliquée

• Documentation et métadonnées

• Entrepôt Bii

(58)

CeSGO & DMP

• Stockage, sauvegarde et sécurité des données

• Datacenter CeSGO pendant la durée du projet (max : 5 ans)

• Ethique et cadre légal

• Protection des données sensibles ou personnelles

• CC version 4.0

• Partage des données

• Accès libre ou restreint

• Délai : 3 ans max après leur collecte

• Entrepôts (GEO, Genbank, SRA, Uniprot, PRIDE, ….)

• Outils nécessaires à la réutilisation / validation des données

• Data paper

(59)

CESGO: 5 GOALS

(60)

CeSGO : Western France e-Science

Data

management

Life sciences protocols

metadata

(61)

CeSGO : Western France e-Science

• New VREs!

(62)

CeSGO : Western France e-Science

• New VREs!

• Connected using semantic web approaches

• Thanks to DOI attribution

(63)

CeSGO : Western France e-Science

Reproducibility

cloud

docker

Galaxy

versioning

(64)

CeSGO : Western France e-Science

Accessibility

wiki

Public resources

Analytics processes Publications Experiments

(65)

Merci de votre attention

eBGO HUB (collaboration) http://www.e-biogenouest.org/

Scitizen portal (citizen science) http://scitizen.genouest.org

EMME portal (data management) http://emme.genouest.org/

Galaxy instance (data analysis) http://galaxy.genouest.org/

GO4Bioinformatics (education ) https://www.e-biogenouest.org/einfrastructure/education

(66)

CeSGO : Western France e-Science

References

Related documents

ADAPTIVE MESH REFINEMENT SOLUTION TECHNIQUES FOR THE MULTIGROUP SN TRANSPORT EQUATION USING A HIGHER-ORDER DISCONTINUOUS FINITE ELEMENT METHOD.. A Dissertation by

Moreover, one can see that the coefficient of variability for the QR is twice as high as that for the CR, which indicates that inventory management enhances liquidity

The presence of ankle- or knee-joint injuries indicates that the pedestrian was hit while in an erect position, (27) as such injuries—especially those caused by the compres-

Figure 1.2 The feasible region for this 3-variable linear programming problem is a polyhe- dron, and the optimal solution (indicated with a little circle) is one of its verticesx.

[r]

At the sample average level of competition (Table VIII allows to conclude that the average number of bidders in an auction is approximately 13, divided in proportion 5:8 of strategic

From the above I propose that in order to lead any kind of global mindset change, one must have access to not only the values of a world- centric view but also a similar level of

The following will be used as part of the selection criteria for the “Military Spouse” collection: authority, accuracy, cost, comparison with other works, scope, and