• No results found

Charged! By the Technology Services Council to:

N/A
N/A
Protected

Academic year: 2021

Share "Charged! By the Technology Services Council to:"

Copied!
43
0
0

Loading.... (view fulltext now)

Full text

(1)
(2)

Charged!

By the Technology Services Council to:

Review and provide an analysis of our current DAMS

(CONTENTdm), and other content management systems

Receive input/feedback from users of current and other

DAMS

Compare features, issues, and functionalities of other DAMS

to ours

Proposal:

The potential of segmenting DAMS on the basis of needs of

different types of content

Proposing a single solution that addresses most needs

(3)

Capacity for timely accommodation of

feature requests or fixing known problems

(OAI)

Local customizations get broken/have to be

ported for upgrades

Features appear and disappear (my favorites)

Does not support all our content types at the

same level (EAD)

(4)

Showcases a wide variety of materials:

Subject matter/media

Digital photographs, newspapers, maps, books, audio

and video recordings, and various other formats items.

Over 450 Digital Collections

Over 2.5 million Digital Objects, and growing…

UDN = 1.5 million pages

CDMbuntu = 1 million objects

Highest no. of objects in CDM, anywhere

Formats

jpegs, jpeg 2000s (zoomable files), pdfs, epub files,

(5)

UU-Marriott Library- Information Technology

Services

UU-Marriott Library –Scholarly Resources &

Collections

UU-Marriott Library – Special Collections

UU-Marriott Library - Research and Information

Services

UU-Eccles Health Sciences Library

UU-Quinney Law Library

(6)

First meeting in: January of 2013

Initial deadline by TSC: May 2013

Deadline extended twice

(To accommodate for) Open source

(7)

DAM REVIEW

SOFTWARE

SELECTION Working Group Scope: look at other peer institutions and PAC12 Institutions ANALYZE USER SURVEY Takeaways- Considerations for Stakeholder Analysis- patron needs pertaining to DAMS Criteria STAKEHOLDER ANALYSIS (IR, SPC, FA, UDN, Eccles, Quinney) Identifying stakeholders for

types of content and keeping their needs in mind in relation to building

out the Requirements Criteria TESTING • Sandbox setup • Hardware/Softwar e requirements • Other FINALYZING S/W LIST FOR REVIEW Scope: limit list to

10

Ensuring that final list of platforms gauge well against

Requirements Criteria

REQUIREMENTS CRITERIA (9 Dimensions) Ensuring the criteria

meets needs of all different formats in Digital Library EVALUATION CRITERIA • Scoring • Baseline • Cost-benefit analysis • Resources DEVELOPING BASELINE Scoring current DAMS INFORMATION GATHERING • Contacting Vendors • Contacting Institutions/Clie nts/Users of platforms • Conference Calls • Presentations (in-house and online) • Webinars VENDOR/CLIENT PRESENTATIONS • Online • In-person • Recordings SCORING SESSIONS (of Platforms) Criteria: • Vendor/Client presentations • Baseline DERTERMINING BASELINE Scoring of currents DAMS (CONTENTdm) DETERMINING DELTA • What other significant improvements are required to justify the change? • Costs?

(8)

Stakeholder Analysis

Software Selection

Requirements Criteria

Vendor Questions List

Scoring Model

(9)

UDN

First time visitors

26%

At least monthly

55%

Genealogists

63%

Accuracy good/excellent

64%

Will return soon

77%

More knowledge of fam hist 62%

Found new sources

66%

From outside Utah

43%

Overall good/excellent

80%

Most frequent suggested

improvement:

ADD MORE CONTENT

CDMbuntu

Overall good/excellent

72%

Navigation good/excellent

72%

Site layout good/excellent

64%

Positive feedback on UI

62%

Relevant, accurate searches

62%

Most frequent suggested

improvement:

(10)

CONTENTdm

Rosetta

Equella

Omeka

Cumulus

ChromAm

Bepress

XTF

Hydra

MDID

NWDA

(11)

CONTENTdm

Online presentation by OCLC

Gave it a score

Established that score as our

baseline

Rosetta

In-house presentation by

Exlibris

Rosetta currently not able to

meet the needs of a true

DAM- does not sufficiently

meet our Requirements

(12)

Equella

Teaching and Leaning

Technologies (TLT) on

campus, heavy users

Scored low

Group decided it did not

meet basic criteria

Omeka

Main strengths lie in the

presentation and exhibits

display

Omeka team did not

respond enthusiastically to

Questions list

It did not meet the main

Requirements Criteria

(13)

Canto Cumulus

Large scale deployments of

Cumulus

Customers include Intel,

Bank of America, NASA, etc.

Limited OAI support

No support for Dublin Core

Not suited for Digital

Libraries

Did not meet basic criteria

ChronAm

Demonstration of content in

ChronAm versus content in

UDN

Scored on 2 dimensions

(technical Infrastructure, and

Patron ease-of-use)

Required provision for

support of article level

metadata missing

Potential for becoming

Hydra head for newspapers

Decision to table ChronAm

until further development

along the process

(14)

Bepress

Scored only as an IR

platform

Cadillac solution for IR only

content

Expensive

Decision to table Bepress

until further developments

on the Open Source front

NWDA (North West

Digital Archive)

Scored only as a solution

for EADs

One workflow for adding

and editing items

Works great from the

user’s perspective

SPC using NWDA until

further developments on

the Open Source front

(15)

MDID

XTF

(16)

9 Dimensions

Patron ease-of-use

Types of content

Ingestion/conversion/barriers to exit

Collection administration

Metadata administration

Integration with other library platforms

Technical infrastructure/administration

Support

Future/strategic directions

Full requirements criteria available on webinar page,

http://mwdl.org/events/DAMS_options.php

(17)

End-user experience

Intuitive, easy navigation

User display settings

Zoom, rotate, download, print, sort, no. items per page, other user

customizations

Search accuracy

discoverability of the content

Handle all formats in similar fashion across all browser types,

operating systems, and mobile devices

Size of the patron base

Advanced search options

Selecting collections/content to search

Multiple fields

Boolean operators

Searching within results

UDN: Date searching (by date range, before/after specific date)

(18)

Major content categories

IR

EAD

UDN

Everything else

File types and datasets

ead files, xml, csv, tab-delimited, zip, epub, ppt, url, kmz,

tiff, jpg, jp2, mp3, mp4, pdf, ppt, etc.

Tiers of content (both object and item level)

UDN: 3-tiered data structure - issue, page, article

Streaming

Streaming capability, based on media types, file types, user

connection, device type (including mobile)

Adjustable feature to determine bandwidth, device, etc. for

best possible

(19)

New content

Ability to load batches of data

UDN: Ingest batches in NDNP METS/ALTO format

plus article.xml files

Create derivative images

Run OCR

Convert existing collection data from CDM

Barrier(s) to Exit

(20)

Content access, security

Metadata options

Configure for each collection

Import/export

Support for common metadata standards (Dublin Core, MODS, EAD, METS,

etc.)

Faceting

Editing capability

Batch editing

Find-and-replace

Thumbnail editing

Movement of objects between collections

No character limit on metadata fields

*CDM’s current limit is 132K

Copyright

Ensuring authenticity and integrity of objects

Flexibility with metadata templates to be able to add appropriate

copyright statements

Control over download file size

Full text search capability (OCR)

*UDN: full-text within article-level metadata.

SEO

(21)

Design of interface

User permissions

Ability of collection mgr’s to do appropriate levels of

maintenance

Website configuration

Display template customizations

Client logos, sort options, fields displayed, etc.

Statistics / Reporting / Logs

Reporting capability by collection and item level.

Item level urls should include collection alias and item #s

to allow for filters

Ability to attach Google Analytics into the reporting

structure

Human read-ability of url

(22)

Primo

OAI harvesting

Rosetta

Link display object with archival object using ARK

(23)

Scale-ability of collections/content

Hardware/Server(s) specs - operating system, RAM, data

storage

Virtualization of the server(s)

Multi-threading of processor(s)

Automated/scheduled data repairs

Scheduling "Cron" jobs

API support

Single Signon - interfacing with CAS

Open standardness / non-proprietary

Language and Framework

Backups

Design of interface

System and server installation, configuration, and upgrades

Userid Admin / Permissions

(24)

Training

Training manual

Help Desk

Online, chat, phone

Expected response time

(25)

Web 2.0 / Social

Publishing ability/permissions/ rights management

Comments/tags

Social sharing, tagging, etc.

Meet evolving technologies

Linked data

Crowd sourcing

Capability to transcribe an audio file or and audio

track of a video file

(26)

Tier 1 Scoring

Based on user experience, knowledge, and research

Tier 2 Scoring

Based on in-house and online demos

Process of elimination

Using CDM as baseline

Systems scoring lower than CDM, eliminated from review

process

Scoring model available on webinar page:

http://mwdl.org/events/DAMS_options.php

(27)

Vendor based solutions not scaling to the

needs of our growing digital library

Need to possibly explore a different option?

Open source?

(28)

Two systems under review for Open Source

category:

XTF

Hydra

Looked at the possible implementations of

(29)

Requires a shift in approach for our digital

library

Cultural shift in libraries

Requires dedicated staff in place of vendor

support

Opportunity to develop our own code and

features

Opportunity to engage with community of

(30)

Code managed on Github:

https://github.com/projecthydra

Lead Institutions:

Stanford, University of Virginia, Hull, Notre Dame,

Northwestern, Penn State, and more.

(31)

Open Source Repository System

o

DAM Support

o

Preservation Support

Supported by active community and

Duraspace foundation,

(32)

Case studies, screencasts, and

documentation available at:

http://projecthydra.org/apps-demos-2-2/

(33)

Fedora - Open Source Repository System

o

DAM Support

o

Preservation Support

o

Supported by active community and Duraspace

foundation,

http://www.fedora-commons.org/

Blacklight – discovery interface

Apache Solr – search and indexing

Ruby on Rails – Development language for

(34)

From call with Stanford, "General pattern is an engineer and

analyst/UI developer bringing an institution online in 6-8

months while working on Hydra and other responsibilities"

-Note this reflects a single Hydra project (eg only an institutional

repository)

At Duke University they started with 2 part-time developers who

now both work on Hydra full-time.

At Tufts, they had an existing Fedora repository. For their

environment, switching to Hydra took 1.5 FTE with an additional

1 FTE from an archivist. Maintenance takes less than the 2.5 FTE

allocated for development.

Northwestern development of Hydra-based MDID replacement

took 2 years of .75 FTE programming with additional project

management, user management, and UI support on project

team.

(35)

Marriott Library has formed a Hydra Development

group.

Attended HydraConnect at UCSD recently.

Have begun initial development.

Needs of University of Utah require multiple Hydra

heads displaying multiple types of content.

We estimate 1-2 year

minimum

development time

depending on staff time and resources, and

feature requests/requirements.

(36)

Situation being analysed:

Marriott Library’s

current DAM System (CONTENTdm) and

(37)

Strengths:

For current, locally hosted solution

• Familiarity of system and features (users already

trained on system; 10+ years of technical and administrative experience)

• Ability to customize display layer

• Able to update meta-data in bulk

• Able to integrate with other library/university

applications

For OCLC’s future (hosted) cloud-based solution

• Reduction in technical staff (e.g. server

administrator and DAMS)

• Upgrades will be seamless and automatically

implemented

• Vendor-client relationship gets simpler (roles are

better defined)

• Reduction in effort required to maintain local

customizations

Weaknesses:

For current, locally hosted solution

• Slow and sometimes unable to accommodate

feature requests or fix to problems

• Local customizations break when system is

upgraded

• Features appear/disappear without warning

• Not scaling to our size and according to our

requirements

• Does not support multiple content types equally

(IR, UDN, CDMbuntu)

• EADs remain to be a problem (currently not

supported)

For OCLC’s future (hosted) cloud-based solution

• Local customizations go away completely

• Does not support multiple content types equally

(IR, UDN, CDMbuntu)

• EADs not supported in hosted environment

• Loss of managing meta-data in bulk

(38)

Opportunities:

For current, locally hosted solution

• No significant time investment required in new

training

For OCLC’s future, hosted, cloud-based solution

• Possible redeployment of technical expertise

(System Administrator and Programmer roles (we may still be required to develop functionality using the CDM API) somewhere else, where better needed

• Crowd-sourced metadata editing capability via

World Cat

Threats:

For current, locally hosted solution

• System features/functionalities malfunctioning

due to scalability issues

• Current implementations eventually become

obsolete

• Continuous efforts in system customizations and

upgrades may become a sunk effort/cost

• Will eventually lose OCLC’s product support if

stay on current locally hosted system

• Forced to migrate to vendor’s future cloud based

solution (without clarification or the availability of new technical and other specifications) For OCLC’s future, hosted, cloud-based solution

• Current implementation eventually becomes

obsolete

• Current efforts on local system, customizations

and upgrades will become a sunk effort/cost

• When forced to migrate to OCLC’s new cloud

based solution, it will lead to added dissatisfaction (based on score of hosted solution on the basis of current knowledge of features)

• Lose control and customizations

• Large client needs won’t be readily addressed in

new solution (CDM’s future, cloud-based solution is geared towards small to middle tier Digital Libraries).

• Future directions of vendor not aligned with ML’s

(39)

Strengths:

Demonstrated history of innovation

and collaboration.

Strong partnership model.

High profile institutions have

committed to becoming formal

Hydra partners.

Hydra is built on open source

components.

Articulated governance structure.

Technology framework for Hydra is

robust, provides a platform for

continued innovation.

Development is directed to future

needs of digital library community.

Sustainable cost structure, not at

liberty of increasing licensing costs

Can use recently developed SIMP

tool as platform for Hydra ingestion.

Weaknesses:

Need to develop in house expertise

in Hydra technologies.

Staffing costs – start up and

ongoing.

Migration time to move digital

materials into new repository.

(40)

Opportunities:

Can develop our digital library with

the features we need.

Ability to engage with dynamic

Hydra software development

community.

Can support data curation (regular

data, not “big data”).

Roadmap for linked data support.

Successful records of grant funding

to complement Hydra development

efforts, provides a grant writing

opportunity.

Ability to provide consistent user

experience to digital library users

(no myfavorites disappearing or

breaking).

Can influence and contribute to

future developments of Hydra code

base.

Offer our services to other UALC

members or other state agencies.

Threats:

Loss of partners who don’t want to

move away from CONTENTdm.

Loss of relationship with OCLC.

Have to be our own technical

support.

(41)

Based on in-depth analysis (including system

scores), the committee has proposed:

Embark on the Hydra Development Plan

Table Bepress And ChronAm as IR and Newspaper

solutions until further information from Hydra

development is acquired

Continue using NWDA for EADs until we have

further information from Hydra Development

(42)

Starting Hydra development efforts.

Identifying resources needed, including

(43)

Kinza Masood

kinza.masood@utah.edu

Anna Neatrour

anna.neatrour@utah.edu

References

Related documents

The community spread in Washington resulted in the first death in the United States from COVID-19, as well as the first reported case of COVID-19 in a health care worker, and the

The proposal should include a listing of desired outcomes, a description of activities planned during the sabbatical, a summary of travel plans (in general terms) and a listing

a 4-CPU Linux server with 4 GBytes of memory, the design stage would involve a search through the IT server standards for the best server menu option that offers at least four

Institute of Electrical and Electronics Engineers (IEEE) does not grant permission for this article to be further copied/distributed or hosted elsewhere without the express

It is possible to monitor the biological variations within the mixed prairie ecosystem using remote sensing techniques; (3) certain spectral vegetation indices are better than

The Aguaytia project includes development of Aguaytia gas field and construction and operation of: gas processing and natural gas liquid fractionation facilities; 299 km of gas

The basic data required for this study are the arrival and departure time of the u–turning vehicles, the arrival time of the conflicting vehicles in the main