• No results found

Workshop : Open and Big Data for Life Imaging

N/A
N/A
Protected

Academic year: 2021

Share "Workshop : Open and Big Data for Life Imaging"

Copied!
19
0
0

Loading.... (view fulltext now)

Full text

(1)

Workshop  :  Open  and  Big  Data  

for  Life  Imaging  

Chris'an  Barillot  

Michel  Dojat  

(2)

Many  Good  Reasons  

for  Sharing  Data  and  Tools  in  In  Vivo  Imaging    

• 

Scien'fic  

«  Power  failure:  why  small  sample  size  undermines  the  reliability  of  neuroscience  »  

Bu9on  et  al.  Nat.  Neurosc  2013  

«  Why most published research findings are false” Ioannidis Plos Med 2005

“Data  from  preclinical  animal  studies  appear  to  be  associated  with  even  greater  bias  »    

Ioannidis  ScienceTranslaDonalMed  2012,    

«  Why  small  low-­‐powered  studies  are  worse  than  large  high-­‐powered  studies  and  how  to  protect   against  trivial  findings  in  research  »  Ingre  Neuroimage  2012  

-­‐

Underpowered  is  endemic

 

(

false  posi've,  false  nega've

)  

(3)

March  2015   FLI-­‐IAM   3  

-­‐

Data  analysis  and  reporDng  are  oKen  selecDve  and  biased  

Wicherts  et  al.  PLOS  One  2011   N=28   N=21   Errors  in   repor'ng     sta's'cal     results   Data  shared   Data  not  shared  

At  Least  3  ….  

• 

Scien'fic  

Many  Good  Reasons  

(4)

Large  cohort,  Re-­‐analysis,  ReplicaDon,  MetaAnalysis    

«  collecDve  mind  »  

Fox  et  al.  Ann  Rev  Neurosci  2014  

«  crowd  science  »

 Franzoni  &  Suerman  Res  Policy  2014  

-­‐

Rarely  replicated  so  false  soluDons  persist  

-­‐

New  quesDons,  increasing  the  speed  of  discovery  

• 

Scien'fic  

Many  Good  Reasons  

for  Sharing  Data  and  Tools  in  In  Vivo  Imaging    

(5)

• 

Economic  

March  2015   FLI-­‐IAM   5  

Data  Acquisi'on:  e.g.  neuroimaging  300€/H  30  subjects  x  2  groups  18  K€  …       Many  efforts  for  image  processing  tools  development  and  valida'on  

dont  replicate  exis'ng  data  and  tools  ….  

Reducing  cost  doing  science,  maximize  investment  

• 

Ethic  

Healthy  subject  and  pa'ents  give  their  consent  for  contribu'ng  to  knowledge   advancement.  

For  preclinical  studies,  ethical  obliga'on  to  ensure  that  the  animals  are  not  wasted.  

Maximize  their  parDcipaDon  

Three  Good  Reasons  

(6)

Ethic  &  Regula'on  

• 

Scien'fic  

How  can  I  have  a  return  on  investment  ?  What  direct    benefit  I  will  get  ?   Who  benefits  from,  when,  and  how?    

Degree  of  control,  fear  of  adverse  use  or  flawed  interpreta'on  or,  exposi'on  of  some   errors  

«  Too  much  effort  »,  appropriateness  of  others’  data    

Wallis  et  al.:      »  goods  are  bartered  between  trusted  colleagues  rather  than  treated  as  

commodi'es  »  PLoS  ONE  2013  

Economic  

Three  Good  Reasons  

(7)

March  2015   FLI-­‐IAM   7  

Ethic  

Scien'fic  

• 

Economic  

The  cost  of  sharing.  Data  should  be  kept  and  curated.     Who  will  make  the  investments?  

Which  Bussiness  model?  

Van  Horn  &  Gazzaniga  «  Have  a  plan  for  what  happens  AFTER  your  grant  funding  runs  

out  »  Neuroimage  2013  

Three  Good  Reasons  

(8)

Three  Good  Reasons  

for  refraining  Sharing  Data  and  Tools  ….  

• 

Ethic  &  Regula'on  

Who  is  the  data  owner  ?  use  data  for  new  non  ini'ally  expected  applica'ons   Have  images  some  specifici'es  ?  (see  UKBioBank)  

Research  subjects  vs  pa'ents  

Securisa'on:  poten'al  failure  in  the  authen'fica'on  system  =>    

       Balance  risk  &  security  level  &….    Bypassing  the  rules    

Scien'fic  

(9)

Researchers,  journals  and  funding  agencies  must  work  together  to  

iden'fy   and   implement   technical   solu'ons   that   allow   the   most  

effec've   data   sharing   without   greatly   increasing   the   burden   on  

researchers.    

 

(10)

Researchers,  journals  and  funding  agencies  must  work  together  to  

iden'fy   and   implement   technical   solu'ons   that   allow   the   most  

effec've   data   sharing   without   greatly   increasing   the   burden   on  

researchers.    

 

Poldrack  and  Gorgolewski  Nat  Neuro  2014  

The   slow   adop'on   of   tools   and   services   such   as   data   repositories  

are   indica'ons   that   technology   alone   cannot   change   scienDsts’  

pracDces;   social   and   cultural   factors   must   also   encourage  

datasharing….  Much  sharing  is  private  rather  than  public.  

 

(11)

We   conclude   that   research   data   cannot   be   regarded   as  

knowledge   commons,   but   research   policies   that   be9er  

incenDvise  data  sharing  are  needed  to  improve  the  quality  of  

research  results  and  foster  scien'fic  progress.  

(12)

We   conclude   that   research   data   cannot   be   regarded   as  

knowledge   commons,   but   research   policies   that   be9er  

incenDvise  data  sharing  are  needed  to  improve  the  quality  of  

research  results  and  foster  scien'fic  progress.  

Fecher  et  al.  PLOS  One  2015  

Impediments   regarding   sharing   supplementary   data   could  

include   clear   and   elaborate   reasons   to   opt   out.   In   order   to  

remove   risk   aversion   and   ambiguity,   an   understandable   and  

clear  legal  basis  regarding  the  rights  of  use  is  needed  to  in-­‐  

form  researchers  on  what  they  can  and  cannot  do  with  data  

they  collected.  

(13)

• 

Ins'tu'onal  aspects    

March  2015   FLI-­‐IAM   13  

What  are  the  ins'tu'onal  ac'ons  to  promote  data  sharing?     Which  Economical  models?  

What  credit  for  data  and  tools  sharing  ?  

 ChrisDne  Balagué  (CCNum,  FR)    

• 

Legal  and  ethics  ques'ons  

What  can  be  shared?  To  whom  data  own?  Status  of  data,  metadata  and  derived  data?   What  are  the  requirements  for  sharing?      

Paul  Oliver  Gibert  (Digital  Ethics,  FR)    

 

• 

Context  of  the  workshop  

Introduc'on  to  FLI-­‐IAM  project  

ChrisDan  Barillot,  Michel  Dojat,  Michael  Kain  (FLI-­‐IAM,  FR)      

An  Open  Discussion  

(14)

An  Open  Discussion  

• 

Pros  and  cons  of  working  architectures      

posi've  and  nega've  aspects  of  sharing  to  large  scale  in  prac'ce?  user  acceptance   Efforts  for  sharing.    

• 

Technical  aspects    

What  are  the  exis'ng  solu'ons?  What  are  the  main  technical  challenges?    

 P.  Mouillard  (Vigisys,  FR)  

 D.  Kennedy  (INCF,  US)  

Wiro  Niessen  (Ronerdam  cohorte,  NL)    

(15)
(16)

Round  Table  

• 

Prior  Issues  

– 

Standards  for  sharing  data  

• 

Do  we  need  standard  for  data  model  (e.g.  ontologies)  ?  

• 

Standard  for  raw/derived  data  format  ?  

• 

Do  we  need  standard    for  interoperability  (on  concepts,  on  

communica'on)  ?  

• 

wait  for  standards  or  go  forward  ?

 

– 

Quality  control  in  Data  Sharing  

• 

Ra'o  cost  /  added  value  

(17)

Round  Table  

• 

Infrastructure  Issues:  

– 

Data  bases:  

• 

Big  Data  Centers  vs  Distributed  Storage  vs  Peer  to  Peer  storage  

(bioTorrent)  ?  

– 

High  Performance  Compu'ng  (HPC)  and  Big  Data  for  in  

vivo  imaging  

• 

Is  HPC  infrastructure  generic  or  specific  to  usages?  

• 

Cloud  compu'ng:  Clusters  vs  Grids  vs  Crowd  Compu'ng  ?  

– 

Image  Processing  Code  Sharing  ?  

• 

Is  it  different  than  Data  Sharing?  

(18)

Round  Table  

• 

Socio-­‐economic  issues:  

– 

Data  Sharing  credit  ?  

• 

Co-­‐authorship  vs  cita'on  vs    payment  ?  

• 

Data  rights  

– 

Economic  model  

•  Cost  for  using  shared  data  ?   •  Who  support  the  cost  ?    

•  Can  data  sharing  become  a  business?  

– 

Is  regula'on  /  ethics  a  real  limit?  

– 

How  funding  agencies/ins'tu'ons  can  help  to  promote  

data  sharing?  

(19)

Round  Table  

• 

Finally,  Bonlenecks  are:  

– 

Socio-­‐economical?  

Or  

References

Related documents

ECP Project Management Structure Board of Directors Science Council Industry Council Project Director Deputy Director CTO Integration Manager D ep ar tm en t o f En er g y

To reduce the grounding resistance, either more land is required or the resistivity of the available land soil must be lowered.. Soil resistivity can be lowered, and the related cost

Over the course of one month, formal interviews were conducted with 18 intravenous drug users, two medical doctors specializing in infectious disease, and one Traditional

for HPV-18, DLG1 was present at cell borders in cultures expressing HPV-11 proteins, and this may 301. be relevant considering the different capacity of each virus in

The distributions on the left cause all of the test statistics to have greatly inflated type-I error rates, while the distributions towards the right result in type-I error

In this paper, we use Attanssov’s idea and establish the interval-valued intuitionistic fuzziffication of notion of (strong, weak, s-weak) hyper BCK-ideals of hyper BCK-algebras

cerebrovascular disease 13,19 led us to hypothesize that the STC-1 concentration in CSF could be altered in AD and/or other neurodegenerative diseases. We found that: 1) STC-1