Long-term data integrity through
controlled data conversion
Agenda
! Content Addressed Storage (CAS) ! CAS for long-term data integrity ! Example: www.staatsbladclip.be
Long-term data integrity
! content integrity ! time integrity ! viewing integrity
Content Addressed Storage (CAS)
action:store
object:clip
Content Addressed Storage (CAS)
action:retrieve
objects
Content Addressed Storage: clip
F9GHC1KFFGH3Le2L9E7VLIV5TNJ hash value h1 hash value h2 hash value h3 metadata h1, h2, h3 hash valueClip
! Clip = hash => data integrity
! Clip is content address => flat addressing space
! CAS only uses content addressing => retrieve only via clip ! Possession of clip = access to data
! Clip = hash over 1 or more files
! Clip is independent of format of data ! Clip is independent of size of data ! Clip is portable
! Clip can contain metadata => metadata integrity ! Data integrity user authentication
From Clip to TrustClip
! Clip + Service Level Agreement = TrustClip ! Which service ?
! Archival (with/out replication)
! Timestamping
CAS for longterm data integrity
-Timestamping
! TSA model
! How? include TS into metadata
Internet
Internet
1 TrustClip
Timestamped Clip
Timestamp (t1) metadata h1 h1 hash value hash value H1(t1)Chained Timestamped Clip
Timestamp (t2) metadata h1 h1 hash value hash value H2(t2) H1(t1)CAS for long-term data integrity –
Controlled Conversion
! T1 (creation time of Clip): include in Clip metadata:
! viewing application (use a clip) ! possibly OS clip
! T2: check if new viewers available ! T2: if yes => conversion
Conversion Controlled Clip
Timestamp (t1) metadata
h1 h1 hash value
Viewing Application X Clip (t1) hash value
H1(t1)
Timestamp (t2) metadata
h1 h1 hash value
Viewing Application Y Clip hash value
H2(t2)
History CAS technology
! CAS = Content Addressed Storage
! Sold exclusively by EMC as Centera product line (2002) ! HyperTrust = first service provider worldwide to integrate
Centera for generalpurpose “Unified Internet Storage” -! see www.send2store.com
Your Content in Action
! HyperTrust = Internet Storage service provider ! any kind of data – any size, any format
! electronic documents (books, contracts, letters, e-mail attachments, CAD/CAM drawings)
! digital records (check images, photographs, or geophysical information - seismic or astronomical research)
Extending legacy application
! (Trust)Clips can travel within the application, but also
between applications, whether within 1 company or between several
! this is also valid for entire legacy application chains; in
this way, reference information is carried along the chain with minimal effort.
! no real limit to the amount of content that can be
Referencing beyond electrons
! as the Clip itself is “small” and strictly text, it is possible
to use other than electronic media (!) to affix electronic reference information to physical objects:
! printed labels
! barcodes (e.g., label with barcode of user manual on back side of appliance)
! OCR (e.g. electronic addenda to magazine articles) ! TrustClip references within printed contracts, e-mails,
Send2Store: CAS as an internet
service
! CAS = first utility-grade universal storage medium
=> well suited to ASP-like delivery
! Application areas
! store and forward ! sharing
! enriching ! referencing ! integrity
Internet
Internet
• E-mail or web upload with attachment(s)
(2) 1 TrustClip
Send2Store
Demos
! www.send2store.com (free 100Mbyte CAS storage) ! www.staatsbladclip.be