• No results found

The ISPS Data Archive: Mission, Work, and Some Reflections

N/A
N/A
Protected

Academic year: 2021

Share "The ISPS Data Archive: Mission, Work, and Some Reflections"

Copied!
20
0
0

Loading.... (view fulltext now)

Full text

(1)

The ISPS Data Archive:

Mission, Work, and

Some Reflections

(2)
(3)

Limor Peer | Yale University | April 2016

Archive Embedded in ISPS Website

(4)

ISPS Data Archive: Basic Facts

• A digital repository for research produced by scholars affiliated with

ISPS, with special focus on experimental design and methods.

Associated data are primarily quantitative and gathered from a

combination of administrative records, surveys, and observations.

• The Archive accepts research outputs underlying scientific

publications including data, code, output, documentation, and other

research artefacts.

• The Archive provides free and public access to research materials and

accepts content for distribution only under a Creative Commons

license.

• The Archive was launched in September 2010 as a pilot at Yale. (We

learned a lot, and still are.)

• The Archive is managed by a full time professional with knowledge of

the specific research domain and graduate student research

assistants who handle most of the data curation.

In: Curating Research Data Volume 2, (forthcoming)

(5)

ISPS Data Archive: Mission

The ISPS Data Archive was built to create

an open access digital collection of social

science experimental data, metadata,

code, and associated files produced by

ISPS researchers, for the purpose of

replication of research findings, further

analysis, and teaching.

(6)

Peer, Green, and Stephenson. 2014. Committing to Data Quality Review.

International Journal of Digital Curation, 19(1): 263-291. doi:10.2218/ijdc.v9i1.317

Limor Peer | Yale University | IASSIST 2016

Data Quality Review Framework

(7)

Curation Tasks

for Quality & Reproducibility

Performed by the ISPS Data Archive pre-publication:

1. Check for missing variable labels and value codes.

2. Review observation count

3. Identify potential data errors

4. Compare questionnaire, codebook, & data

5. Ensure there is no personally-identifiable information (PII)

6. Confirm code executes

7. Confirm code produces reported results

8. Create open formats

In: IASSIST Quarterly (January 2016)

(8)

New Curation Tool:

Curator @Yale

 Structures and tracks the curation workflow

 Helps automate parts of the review pipeline

 Captures all metadata throughout the process

 Pushes out relevant information to

pre-determined destinations

o

i.e., a user, the archive administrators, a Web based

dissemination system, or preservation systems

 Can fit into repository and research workflows

(9)

New Curation Tool:

Curator @Yale

• Leverages DDI Lifecycle

– Machine executable, open structured format supports

research transparency

– Plays a part in review tasks – study, file & variable levels

• Modular, open-source

– Could be adapted to changing needs, research methods,

dissemination platforms, and preservation solutions

– Could be used by repositories, researchers, and research

staff

– Flexible deployment and configuration options

(10)

New Curation Tool:

Curator @Yale

(11)

New Curation Tool:

Curator @Yale

(12)

New Curation Tool:

Curator @Yale

(13)

New Curation Tool:

Curator @Yale

(14)

New Curation Tool:

Curator @Yale

(15)

Reflections: What’s Next?

• Max Weber

• German sociologist, philosopher,

political economist

• Inevitable and linear trend toward

increasing rationalization,

systemization, and routinization

Limor Peer | Yale University | IASSIST 2016

Will curation for quality &

(16)

1. Research data policies & culture mature

 Wide recognition that curating for quality and

reproducibility is a valued goal

 Researchers are rewarded for quality

of materials shared (e.g., badges)

o

OSF, ACM-DL

 Gatekeeping is acknowledged as an

essential function

Limor Peer | Yale University | IASSIST 2016

Curation for Quality & Reproducibility

Will Become Routinized When…

(17)

2. It is easier and cheaper to do

It can be better automated

o

Curator tool, APIs

Computational methods evolve to

readily capture the entire workflow

and preserve it

o

R knitr, R Sweave, Docker, ReproZip

Limor Peer | Yale University | IASSIST 2016

Curation for Quality & Reproducibility

Will Become Routinized When…

Curator

@Yale

(18)

3. It is a part of active research process

http://www.thesleuthjournal.com/military-co-option-media-entertainment/

Limor Peer | Yale University | IASSIST 2016

Curation for Quality & Reproducibility

Will Become Routinized When…

(19)

1. Research data policies & culture mature

2. It is easier and cheaper to do

2. It is a part of active research process

Limor Peer | Yale University | IASSIST 2016

Curation for Quality & Reproducibility

Will Become Routinized When…

Can Curation Prevent the Next Data Sharing Disaster?

(20)

Thank you!

Limor Peer

[email protected]

@l_peer

References

Related documents

By working to increase the self-concept of students with disabilities, school counselors can help reduce the likelihood of students in Individualized Education Programs

The Group Executive Committee consists of (I) the CEO, (II) the Head of Baltic Banking, (III) the Head of Retail, (IV) the Head of Large Corporates & Institutions, (V) the

Economic policies that came about at the same time as social refonns did not specifically target marriage or family structure, but contributed to changes in

Aristonate C-5000 Neutral Calcium Sulfonate Liquid / 50% High performance corrosion inhibitor and demulsifier Calamide F Oleic Diethanolamide Liquid / 100% Water soluble

One example is how Specter’s ERP system has functions for the users (typically an e-shop owner) to forward the orders from customers to the supplier’s warehouse system,

T1 -weighted images (Figure 3) have shown that the MR signal from benign pleural plaques com- pared to muscles was hypo-intense to iso-intense in all patients

In this chapter, we examine the unauthorised movement of goods across the border to and from Indonesia's Riau Islands from the perspective of the individuals who smuggle and

The current study, conducting a formal analysis on the estimators of imputation methods, finds that the single imputation method, similar to the time-dummy method, produces