• No results found

MANAGING DIGITAL CONTENT OVER TIME

N/A
N/A
Protected

Academic year: 2021

Share "MANAGING DIGITAL CONTENT OVER TIME"

Copied!
19
0
0

Loading.... (view fulltext now)

Full text

(1)

M A N A G I N G D I G I TA L C O N T E N T O V E R T I M E

P a r t 3 . S t o r e

(2)

M O D U L E S

Identifyde t y ‐ whatat d g ta co te t do you a e digital content do you have? 

Select ‐ what portion of that content will be preserved? Store ‐ what issues are there for long term storage? g gProtect ‐ what steps are needed to protect your digital 

content? 

Manage ‐ what provisions are needed for longterm 

management? 

P id   h   id i     h  f  l  

Provide ‐ what considerations are there for longterm 

(3)

O U T L I N E O F “ P R O T E C T ” D I S C U S S I O N

A. StorageSto age eeds needs

B. Well‐managed collections C. Objectj ‐level metadata

D. Storage considerations

E. Preservation repositoryp y selection F. Outcomes

(4)

A . S T O R A G E N E E D S

Think o you objects as pac ages o dataof your objects as packages of data

 Archival storage manages multi‐part content as single objects

(5)

A . S T O R A G E N E E D S

Think o you objects as pac ages o dataof your objects as packages of data

 Requires some identification and description

 Metadata

(6)

B . W E L L‐M A N A G E D C O L L E C T I O N S

SampleSa p e c a acte st cs characteristics:

 Basic information about each deposit

Administrative metadata

Some metadata for objects (you define)Descriptive metadata

 Common (or normalized) file formats

 Common (or normalized) file formats

But keep those originals

Controlled and known storage of contentWhere is this stuff going?

Multiple copies in at least 2 locationsWhere else is this stuff going?

(7)

B . W E L L‐M A N A G E D C O L L E C T I O N S

Basic information about each deposit

“Easy” preservation metadata scheme: 

h /di / d h l

www.ncecho.org/dig/pmdo.shtml

Some metadata for objects

Descriptive metadata

 Common (or normalized) file formats

TIFF, JPG, PDF

Free imageg  editor: www.irfanview.comBut keep those originals

Controlled and known storage of 

content

Where is this stuff going?

Multiple copies in at least 2 locations

Where else is this stuff going?Where else is this stuff going?

(8)

C . O B J E C T‐L E V E L M E T A D A T A

Metadataetadata e ab es o g te enables longterm preservationp ese at o

 uniquely identifies digital objects

 makes digital objects understandable

(9)

C . O B J E C T‐L E V E L M E TA D ATA

P

i

 

M

d

Preservation

 

Metadata

Content (what), Fixity (unchanged), Provenance (life story), 

Reference (this thing)  Context (relationships) Reference (this thing), Context (relationships)

Administrative (manage) Structural (understand  use) Descriptive (find, use) (understand, use)

(10)

C . O B J E C T‐L E V E L M E TA D ATA

1. Content:Co te t p ese e t e substa ce preserve the substance

 Save the original file, even if you migrate

2. Fixity: demonstrate content is unchanged 

Checksums: http://www.nirsoft.net/utils/hash_my_files.html 3. Format validation: ensure that it is what it purports to be

Jhove: hul.harvard.edu/jhove/windows_xp.html

(11)

C . O B J E C T‐L E V E L M E TA D ATA

4. Authenticity: trace to its origin, deposit, and/or your  4 ut e t c ty t ace to ts o g , depos t, a d/o you

management actions

 Metadata: Date created, date changed, responsible party

5. Context: preserve linkages with other objects   Metadata: Unique identifiers

(12)

D . S T O R A G E C O N S I D E R A T I O N S

Redundancy: How many copies?

Minimum: two (2) copies in two locationOptimum: six (6) copies

Other storage questions

 Are your files too large to store 6 copies? [Videos]  Online, nearline, offline?

Any legal restrictions? [offsite locations]

On what types of media to store your content?

a. Mirrored, networked servers b. Networked server

c. Your computer and another d. External hard drives

(13)

D . S T O R A G E C O N S I D E R AT I O N S

Factors

 

to

 

consider

Factors

 

to

 

consider

Cost (available resources for preservation)Quantity (size and number of files)

k ll d

Expertise (skills required to manage)

Partners (achieving geographic distribution) Services (outsourcing)

(14)

D . S T O R A G E C O N S I D E R A T I O N S  Multiple  geographically Multiple, geographically  distributed copies  Storageg p partners Hosted services

(15)

E . P R E S E R VAT I O N R E P O S I T O R Y S E L E C T I O N

Questions to ask when deciding to use (build, join, buy) a 

repository

 Is the repository best suited to general or specialized content?

Do you want an open source or proprietary system?Do you want an open source or proprietary system?

How easy is it to manage? [Installation, update, batch upload, 

etc.]

 Dark or open archive?

Each option has pros and cons

No system is fully compliant to standards Select best option for your content – for now

(16)

F. O R G A N I Z AT I O N A L R E Q U I R E M E N T S

Digital

 

preservation

 

requires

 

an

 

organization

 

to:

Develop a storage management policy

E.g., number of copies, locations, fixity means

Specify storage service or partner agreementsSpecify storage service or partner agreements

Will you give back a fully functioning file in 50 years, or 

only promise to manageg  the bits?

Monitor copies of content for errors/changePlan for media replacement

Consider file obsolescence and how you’ll manage it

(17)

F. O R G A N I Z A T I O N A L R E Q U I R E M E N T S

1.

Develop

 

a

 

storage

 

management

 

policy

1.

Develop

 

a

 

storage

 

management

 

policy

E.g.,

 

number

 

of

 

copies,

 

locations,

 

fixity

 

means

2.

Specify

p

y

 

storage

g

 

service

 

or

 

partner

p

 

agreements

g

Will

 

you

 

give

 

back

 

a

 

fully

 

functioning

 

file

 

in

 

50

 

years,

 

or

 

only

 

promise

 

to

 

manage

 

the

 

bits?

(18)

F. O R G A N I Z A T I O N A L R E Q U I R E M E N T S

3.

Monitor

 

copies

 

of

 

content

 

for

 

errors/change

3.

Monitor

 

copies

 

of

 

content

 

for

 

errors/change

FITS: http://code.google.com/p/fits/LOCKSS, MetaArchive

4.

Plan

 

for

 

media

 

replacement

 

($$)

5.

Consider

 

file

 

obsolescence

 

and

 

how

 

you’ll

 

manage

 

it

(19)

T O O L S

DIGITALG  PRESERVATIONS O  POLICYO C  TOOLOO  – erpae pa gu da ce guidance http://www.erpanet.org/guidance/docs/ERPANETPolicyTool.pdf

HASHMYFILES – Checksum creator

htt // i ft t/ til /h h fil ht

http://www.nirsoft.net/utils/hash_my_files.htm

JHOVE – Format validation and identification 

http://hul harvard edu/jhove/windows xp html http://hul.harvard.edu/jhove/windows_xp.html

IRFANVIEW ‐ Free image editor (normalization) 

http://www.irfanview.comp //

NCPMDO ‐ “Easy” Preservation Metadata Element Set 

References

Related documents

42 Figure 11 Graph depicting mean Soil Plant Analytical Development (SPAD) values for nine treatments (inoculation type x salinity level) on Caprice snap beans during weeks

graph models (ERGMs), which have great potential for modelling animal social

Key fndings of this work include: (1) a sub–greenschist facies N-S to NE-SW contractional event preserved in the Salinian block adjacent to the fault associated with

“The means used for implementing monitoring and evaluating the communication plan shall be proportional to the information and publicity measures identified in the

Consider using this LKM if your source data located on a generic database is large, and when your staging area is a Sybase Adaptive Server Enterprise database. Load LKM Sybase ASE

CA Multi-family Affordable Housing (MASH) Virtual Net Metering (VNM) 1 MW Included within the state's NEM cap - 5% of utility's Aggregate Customer Peak Demand

hard to compute accurately for the multivariate von Mises distribution, and it is not appropriate to compare distributions from different families. Here, we propose to use

Spaces of homogeneous type, orthonormal basis, test function space, distributions, Calder´ on reproducing formula, wavelet expansion, product Hardy space, Carleson measure space,