An Introduction to EAD, EAC-CPF,
and Archival Metadata Standards
7
thInternational Seminar of Archives
from Iberian Tradition,
1 July 2011
Formats for Exchanging Archival
Data
Michael Rush
Accessioning Archivist / EAD
Coordinator,
Beinecke Rare Book and Manuscript
Library, Yale University
Co-Chair, Technical Subcommittee for
Encoded Archival Description, Society
Assumptions
Familiarity with some ICA standards - ISAD
(G), ISAAR (CPF), ISDF, or ISDIAH
Awareness of EAD, but little of no experience
with it
Definitions
EAD: Encoded Archival Description
EAC-CPF: Encoded Archival Context –
Corporate bodies, Persons, and Families
XML: Extensible Markup Language
EAD Development History
Berkeley Finding Aid Project (1993-1995)
EAD Alpha (1996)
EAD Beta (1996)
EAD 1.0 (1998)
EAD 2002 (2002)
EAD 2002 Schema (2007)
EAD 2013?
EAC-CPF Development History
Meeting at Yale University (1998)
Meeting at University of Toronto (2001)
EAC Beta (2004)
Governance and Maintenance
EAD
EAD Working Group (1995-2010)
Technical Subcommittee for EAD (2010- )
EAC-CPF
Ad hoc working group (2001-2004) EAC Working Group (2007-2011)
Technical Subcommittee for EAC-CPF (2011- )
EAD Design Goals
Represent hierarchical structure of finding
aids
SGML, then XML
Flexibility, to encourage adoption.
Compatibility with ISAD (G)
EAD Applications
Delivery
Standardization
Sharing
Transmission/Communication
Repurposing
Example EAD Implementations
Yale Finding Aid Database
Online Archive of California
(OAC)
Northwest Digital Archive
(NWDA)
EAC-CPF Design Goals
Close compatibility with ISAAR (CPF)
A change from EAC Beta to current schema
XML
Philosophical neutrality
Relatively simple and straightforward
Extensible design
EAC-CPF Applications
Identity/Authority
Description
Relationships
Aggregation
Transmission/Communication
Example EAC-CPF Implementation
The Social Networks and Archival Context Project (SNAC)
Challenges
Data migration and/or creation
Establishing encoding best
practices
Delivery
Indexing and search
Display
Data maintenance
Sharing
Data migration/creation
Methods
Hand encoding Templates Scripting Outsourcing Export from databases (Archivists’ Toolkit,
Archon, ICA AtoM)
Costs
Staff time
Staff training
Consultant or outsourcing fees Software
Encoding Best Practices
Local
Yale EAD Encoding Best Practice Guidelines [EAD]
Consortial
Northwest Digital Archives Best Practice Guidelines [EAD]
RLG Best Practice Guidelines for Encoded Archival Description
Delivery
Indexing and search
No single solution
Popular tools include :
➢
XTF (eXtensible Text Framework)
➢
Fedora Commons Repository Software
Display
Transformation via XSLT (Exstensible
Stylesheet Language – Transformations)
➢
XML --> HTML
➢XML --> PDF
Data Maintenance
File management
Version control
Sharing
Consortia
Bulk Aggregators
ArchiveGrid
Topical Aggregators
Related Description Standards
ICA standards:
ISAD(G): General International Standard Archival
Description - Second edition
ISAAR(CPF): International Standard Archival
Authority Record for Corporate Bodies, Persons and Families, 2nd Edition
ISDF: International Standard for Describing
Functions
ISDIAH: International Standard for Describing
Institutions with Archival Holdings
National Description Standards
EAD: Basic Structure
<ead>*
<eadheader>*
<archdesc>*
EAD Header
<eadheader>*
<eadid>*
<filedesc>*
<profiledesc>
<revisiondesc>
File Description
<filedesc>*
<titlestmt>*
➢<titleproper>*
➢<author>
<publicationstmt>
➢<publisher>
Profile Description
<profiledesc>
<creation> -
Creation
<langusage> -
Language Usage
<descrules> -
Descriptive Rules
Revision Description
<revisiondesc>
<change> - Change
➢
<date> - Date
➢<item> - Item
EAD: Basic Structure
<ead>*
<eadheader>*
<archdesc>*
Hierarchical Encoding
<archdesc>
Top level of description.
<dsc>
Optional child of <archdesc>
Components
<c> - Component (Unnumbered)
Or
<c01> - Component (First Level)
<c02> - Component (Second
Level)
➢
Through <c12> - Component
Descriptive Elements
Valid as at all levels of
description
<did> is required at each level
of description.
Descriptive Identification
<did>*
Always the first child of
<archdesc> and the
component elements.
Wrapper element containing
elements with basic identifying
information.
<did> Children
<unitid> - Unit Identification
[ISAD(G) 3.1.1]
<unittitle> - Unit Title
[ISAD(G) 3.1.2]
<did> Children (continued)
<unitdate> - Unit Date
[ISAD(G) 3.1.3]
<physdesc> - Physical
Description
<did> Children (continued)
<origination> - Origination
[ISAD(G) 3.2.1]
<langmaterial> - Language
of the Material [ISAD(G) 3.4.3]
<did> Children (continued)
<note> - Note [ISAD(G) 3.6.1]
<abstract> - Abstract
<physloc> - Physical Location
<materialspec> - Material
Specific Details
<did> Children (continued)
<did>
<container> - Container
<dao> - Digital Archival
Object
<daogroup> - Digital
<did> Siblings
<bioghist> - Biography or History
[ISAD(G) 3.2.2]
<custodhist> - Custodial History
[ISAD(G) 3.2.3]
<acqinfo> - Acquisition Information
[ISAD(G) 3.2.3]
<did> Siblings
<scopecontent> - Scope and
Content
[ISAD(G) 3.3.1]
<accruals> - Accruals
[ISAD(G) 3.3.2]
<appraisal> - Appraisal
[ISAD(G) 3.3.3]
<arrangement> - Arrangement
[ISAD(G) 3.3.4]
<did> Siblings (continued)
<accessrestrict> - Conditions
Governing Access [ISAD(G) 3.4.1]
<userestrict> - Conditions Governing
Use
[ISAD(G) 3.4.2]
<phystech> - Physical Characteristics
and Technical Requirements [ISAD(G)
3.4.4]
<otherfindaid> - Other Finding Aid
[ISAD(G) 3.4.5]
<did> Siblings (continued)
<originalsloc> - Location of Originals
[ISAD(G) 3.5.1]
<altformavail> - Alternative Form
Available
[ISAD(G) 3.5.2]
<relatedmaterial> - Related Material
[ISAD(G) 3.5.3]
<separatedmaterial> - Separated
Material
<did> Siblings (continued)
<bibliography> - Bibliography
[ISAD(G) 3.5.4]
<note> - Note
[ISAD(G) 3.6.1]
<odd> - Other Descriptive Data
[ISAD(G) 3.6.1]
<processinfo> - Processing
Information
<did> Siblings (continued)
<prefercite> - Preferred
Citation
<controlaccess> - Control
Access
<fileplan> - File Plan
<index> - Index
SINGLE IDENTITY: one person (or corporate body or family) with a single identity represented in one EAC-CPF instance. (Most common.)
MULTIPLE IDENTITY-MANY IN ONE: two or more identities (including official identities) with each represented by distinct descriptions within one EAC-CPF instance. Can be programmatically converted into Multiple Identity-One in Many. (Less common though not rare.)
MULTIPLE IDENTITY-ONE IN MANY: two or more identities (including official identities) each represented in two or more interrelated EAC-CPF instances. Can be
programmatically converted into Multiple Identity-Many in One. (Less common though not rare.)
ALTERNATIVE SET: derived EAC-CPF instance that is based on and incorporates two or more alternative EAC-CPF instances for the same entity. To be used by a consortia or a utility providing union access to authority records maintained in two or more
systems by two or more agencies. Alternative EAC-CPF instances may be in different languages or in the same language.
COLLABORATIVE IDENTITY: a single identity shared by two or more persons (e.g. a shared
pseudonym used in creation of a collaborative work). Use Multiple Identity-One in Many. (Rare.)
<eac-cpf>*
<control>*
<cpfDescription>
➢<identity>
➢<description>
➢<relations>
Basic structure
Basic Structure
<control>: identity, creation, maintenance,
status, rules and authorities, and sources
used to generate the EAC-CPF instance.
<cpfDescription>: description of the EAC-CPF
entity
<identity>: names
<description>: formal and informal descriptive elements
<relations>: relationships to other entities, resources and function descriptions
<eac-cpf> <control></control> <cpfDescription> <identity></identity> <description></description> <relations> <cpfRelation></cpfRelation> <cpfRelation></cpfRelation> </relations> </cpfDescription> </eac-cpf>
Philosophical neutrality (1)
Philosophical neutrality (2)
<eac-cpf>
<control></control>
<multipleIdentities>
<cpfDescription></cpfDescription>
<cpfDescription></cpfDescription>
<cpfDescription></cpfDescription>
</multipleIdentities>
</eac-cpf>
<control>
<recordId>* <maintenanceAgency>* <maintenanceStatus>* <maintenanceHistory>* <publicationStatus> <languageDeclaration>* <sources>* <conventionDeclaration> <otherRecordId> <localControl> <localTypeDeclaration><entityType>*
<nameEntry>**
<nameEntryParallel>**
<entityId>
<descriptiveNote>
<cpfDescription>/<identity>
Basic Name Models
<nameEntry>
<part></part>
<useDates></useDates>
</nameEntry>
<nameEntryParallel>
<nameEntry></nameEntry>
<nameEntry></nameEntry>
</nameEntryParallel>
<existDates> <function> <generalContext> <legalStatus> <languageUsed> <mandate> <occupation> <place> <biogHist> <structureOrGenealogy> <localDescription>
<cpfDescription>/<description>
<cpfRelation>
<functionRelation>
<resourceRelation>
◦ @*RelationType ◦ <relationEntry> ◦ <objectBinWrap> ◦ <objectXMLWrap>◦ <date>, <dateRange>, <dateSet> ◦ <place>
◦ <descriptiveNote>
@cpfRelationType
identity hierarchical hierarchical-parent hierarchical-child temporal temporal-earlier temporal-later family associative@resourceRelationType
creatorOf subjectOf other
@functionRelationType
controls owns
performs
EAD Revision Timeline
Comment period complete (October 2010 –
February 2011)
EAD Revision Forum (SAA Annual Meeting,
August 2011)
TS-EAD Working Meeting (March 2012)
Release draft schema (Fall 2012)
Second comment period (Winter 2013)
Finalize schema and documentation (Spring
2013)
EAD Revision Goals
Clarify relationship with EAC-CPF
Improve interoperability with databases
Reconsider finding aids as documents or data
Simplification
To eliminate unnecessary complexity To make implementation easier
Improve usability
Enable profiles (schema subsets)
Data-friendly
Implementation-friendly (may or may not be
Future EAC Development
EAC-CPF Implementation
Review by 2016
Companion EAC standards?
EAC-Functions (EAC-F)?