• No results found

Digital Preservation and Access - The 5 Opens Project

N/A
N/A
Protected

Academic year: 2021

Share "Digital Preservation and Access - The 5 Opens Project"

Copied!
60
0
0

Loading.... (view fulltext now)

Full text

(1)

Less Code, More Product:

Leveraging Open Source Technologies to Develop Digital Library Collections

Robert Cartolano, Stephen Davis, Carole Ann Fabian Columbia University

(2)

Digital Preservation & Access

“Digital preservation is the active management of digital content over time to ensure ongoing access.”

Library of Congress

“Digital preservation is a challenge for all of society because we all benefit from reliable, authentic

information now and into the future. Done well, all of

society will reap the benefits of digital stewardship.

(3)

Distributed Preservation & Access,

Radical Collaboration

“Long-term preservation of digital information on a scale adequate for the demands of future research and

scholarship will require a deep infrastructure capable of supporting a distributed system of digital archives.”

"Preserving Digital Information

Report of the Task Force on Archiving of Digital Information, 1996

"Conditions in technology, economics, and expanding service requirements demand a more radical approach to collaboration."

"Advancing From Kumbaya to Radical Collaboration: Redefining the Future Research Library" Neal, J., Journal of Library Administration, 2011.

(4)

Strategic Challenges

•  Organization – local and distributed

•  Technical Infrastructure – durable,

interoperable, flexible

•  Collections – collect, aggregate, manage,

(5)

Approach

•  Close Partnerships – develop and

sustain technology platforms over time •  Strategic Investment – local and

(6)

The Five Opens

•  Open Code – sustainable platforms

•  Open Data Standards – manage over time

•  Open API/Protocols – foster interoperability

•  Open Communities – go further together

(7)

“Open” Fosters

Preservation & Access

Digital preservation and access is greatly fostered by adopting the “Five Opens”

and partnering to develop local and distributed systems.

(8)

Key Benefits of Open

•  Less Code – Reduce local efforts,

leverage community development

•  More Product – build better services

for today

•  Contribute – local development,

support entire community over time

(9)

Columbia Strategic Investments

•  Fedora – durable object repository

•  Blacklight – common discovery layer

•  Hydra – building, sharing apps & tools

•  Multiple data centers

•  Columbia private network

(10)

Implementing

(11)

Columbia Digital Collections Scope

•  Digitized special & archival collections

•  Born-digital content

•  Institutional repository content

•  Research data sets

•  Numeric data files

(12)
(13)
(14)

Challenges

•  Unique, custom code for each project

•  Deprecated and unsupported software

versions

•  No longer best “best practices

•  Passé “state of the art

•  Divergent metadata and content models

(15)
(16)
(17)

Benefits

•  Common code base, release versions

•  Easier software upgrades and bug fixes

•  Faster implementation of new projects

•  New features shared across projects

•  Better asset management

•  Easier metadata curation

•  Ability to benefit from - and contribute

(18)

Less Code

Result of moving Columbia DL

applications from custom code to Hydra framework:

ca. 10,000 lines of local code removed so far And many 10s of thousands as we retire

(19)

More Product

Sample DL project development time, 1.5 FTE developers:

5-6 weeks

New DL project development time for comparable project, 1.5 FTE developers:

(20)
(21)
(22)
(23)
(24)
(25)

SEYMOUR B. DURST OLD YORK LIBRARY

“I don’t collect books out of nostalgia, but out of a love of information and history”

~~~

(26)

The Collection

•  Book-like materials

•  Books & Pamphlets 8,676

•  Landbooks 30

•  Serials 519

•  Visual materials

•  Photographs 3,000

•  Thomas Nast Collection 1,200

•  Lantern & 35mm slides 75

•  Postcards 22,594

•  Maps

•  Oversize maps 450

(27)

Durst project goals

  Support planning   Add capacity

  Implement projects to ensure broad access:

•  Catalog & house the collection

•  Digitize the public domain content

(28)
(29)
(30)

Project: data development

9,839 bibliographic records •  3,151 copy •  6,688 original 23,780 metadata records •  23,780 geo-location •  15,153 subject headings

(31)

Project: digitization

•  Acceptable copy: 1,559

•  Internet Archive: 2,997 •  Backstage: 843

•  CUL-PRD: 368

•  Avery contract photographer: 87

CUL Project digitized: 5,852 CUNY digitized 14,559

Acceptable Copy

Backstage

Avery Internet Archive

(32)
(33)
(34)
(35)
(36)
(37)
(38)
(39)
(40)
(41)
(42)

Image palette

(43)
(44)
(45)

Place-Based Discovery

•  Biggert Architectural Vignettes

•  New York Real Estate Brochure Collection

•  Seymour B. Durst Old York Library

(46)
(47)
(48)
(49)
(50)
(51)
(52)
(53)
(54)
(55)
(56)

Course integration

“History of Real Estate Development in New York City”

GSAPP – RED program

Summer session: 2012, 2013, 2014 ~100 grad students each year

2014 summer course interactions:

•  basic instruction session (90 students)

•  reference interaction (67 students)

(57)

Symposia

GSAPP CURE: The Center for Urban Real Estate May 11, 2012

Megaprojects:

An Urban Planning and Development Conference April 11, 2013

Mind the Gap:

Transit Lessons from New York and London April 25, 2014

(58)
(59)
(60)

References

Advancing from Kumbaya to Radical Collaboration: Redefining the Future Research Library, Neal, James, Journal of Library Administration, 1/2011, http://hdl.handle.net/10022/AC:P:20919

Blue Ribbon Task Force on Digital Preservation and Access, 2/2010, http://brtf.sdsc.edu/biblio/ BRTF_Final_Report.pdf

"Challenges and Opportunities of Open Source in Higher Education", Fuchs, Ira, The Tower and the Cloud, Educause, 2008, pp. 150-157, http://net.educause.edu/ir/library/pdf/pub7202.pdf

Definitions of Digital Preservation, ALA, 6/2007, http://www.ala.org/alcts/resources/preserv/defdigpres0408

The Open-Source Movement, Warger, Thomas A., Educause Review, 1/1/2002, http://www.educause.edu/ero/ article/open-source-movement

References

Related documents

By exposing the lack of fully present meaning in the act of bearing witness, especially when used as testimony to the events of the Holocaust and experiences of exile, Wiesel

We structured the study to consist of two sets of image samples per newborn: a baseline and a follow-up. The baseline was taken at the UWMC within the first 24 hours

UOB Kay Hian and its affiliates may act as market maker or have assumed an underwriting position in the securities of companies discussed herein (or

Marines who do not meet established weight and body composition standards or fail to present a suitable military appearance are not eligible for reenlistment or voluntary

This thesis tackles the problem of identifying high-engaging breaking news rumors of emerging topics spreading in social media by proposing a multi-task neural network model

• Create shared data warehouse and public health data sets • Negotiate agreements for access to BRFSS and other data sets • Provide mechanism for cross-disciplinary

Our department conducts research on the molecular and cellular biology of bone and joint diseases (including alveolar bone and the temporomandibular joint) that lower the QOL of

Hypothesis 1: It was hypothesized that parents' general knowledge of their children's peer relationships and their efforts to provide play dates for their children, the