• No results found

AIIM ECM Certificate programme

N/A
N/A
Protected

Academic year: 2021

Share "AIIM ECM Certificate programme"

Copied!
27
0
0

Loading.... (view fulltext now)

Full text

(1)

Taxonomy

Enterprise Content

Management (ECM)

ECM ECM ECM Case Study

AIIM ECM Certificate programme

ECM Strategy ECM Practitioner ECM Specialist Case Study 2

(2)

ECM Practitioner Course Outline

Foundations Tools & Instruments

1. Introduction 2. Technologies & Functionality 4. Create & Capture 5. Metadata 7. Security & Control 10. Delivery & Presentation 8. Process & Automation 11. Trends & Directions Futures 3 3. Information Architecture 9. Findability 6. Taxonomy

© AIIM | All rights reserved 3

Agenda

ƒ

Defining taxonomies and classification

ƒ

Subject-based classification

ƒTaxonomies

ƒFolksonomies

ƒOntologies

ƒThesaurus and Semantic networks

ƒ

Business case for classification

4

ƒ

Business case for classification

ƒ

Standards and guidelines

ƒ

Classification challenges

(3)

Agenda

ƒ

Defining taxonomies and classification

ƒ

Subject-based classification

ƒTaxonomies

ƒFolksonomies ƒOntologies

ƒThesaurus and Semantic networks

ƒ

Business case for classification

5

ƒ

Business case for classification

ƒ

Standards and guidelines

ƒ

Classification challenges

© AIIM | All rights reserved

Defining taxonomy (1)

ƒ

Taxonomy is the science of classifying information

ƒ

A taxonomy is a law for classifying information

ƒ

Taxonomies are nearly ubiquitous, but poorly

understood

6

Source: Dictionary.com

(4)

Defining taxonomy (2)

ƒ

“In recent years, the business world has fallen in love

with the term ‘taxonomies’. We use it specifically to refer

with the term taxonomies . We use it specifically to refer

to a hierarchical arrangement of categories within the

user interface of a website or intranet.”

7

Source: Information Architecture for the World Wide Web (Louis Rosenfeld and Peter Morville, 2002)

© AIIM | All rights reserved

AIIM website

1 2 4 3 8 © AIIM | All rights reserved
(5)

Understanding taxonomies

ƒ

A taxonomy is a classification scheme

ƒSuch as the way that an individual classifies the content of their e mail

ƒSuch as the way that an individual classifies the content of their e-mail inbox, a personal CD collection, or the contents on an iPod

ƒ

A taxonomy is a knowledge map

ƒReflects how it’s owner conceives a given body of content (a knowledge domain), for purposes of browsing, navigating, discovering, and sharing that information

9

ƒ

A taxonomy is semantic

ƒIndicating the relationships between concepts, such as the relationships between a car and a steering wheel, in that the steering wheel is a “part of” a car

Source: Organising Knowledge (Patrick Lambe, 2007)

© AIIM | All rights reserved

Category perspectives

ƒ

Business function

ƒ

Geo-political

ƒ

Company focus vs. industry focus

ƒ

Product or service

ƒ

Business issues, conditions, events

10

ƒ

Type/Source of content

(6)

Representations of taxonomies (1)

ƒ

Lists

ƒ

Trees

ƒ

Hierarchies

ƒ

Polyhierarchies

ƒ

Matrices

11

ƒ

Facets

ƒ

System Maps

© AIIM | All rights reserved

Source: Organising Knowledge (Patrick Lambe, 2007)

Representations of taxonomies (2)

ƒ

Lists

ƒSimple collection of related

ƒSimple collection of related things. The relationship is defined by the purpose of the list.

ƒGood when domain is simple, amount of content is small. Basic building blocks of all other taxonomical representations

ƒExamples: Country codes types

12

ƒExamples: Country codes, types of diseases

Source: Organising Knowledge (Patrick Lambe, 2007) Source: Wikipedia

(7)

Representations of taxonomies (3)

ƒ

Trees

ƒRepresents a transition from

ƒRepresents a transition from general to more specific relationships or whole to part.

ƒGood when a list gets to be too long, and “naturally” breaks into subcategories.

ƒExamples: Yellow pages (phone directories)

13

directories)

Source: Organising Knowledge (Patrick Lambe, 2007) Source: CoreFiling.com

© AIIM | All rights reserved

Representations of taxonomies (4)

ƒ

Hierarchies

ƒA specific tree structure that has

ƒA specific tree structure that has inclusiveness, consistency, and maintains the same “type” of relationship at each level. The “child” inherits all of the

characteristics of the “parent” and each child can only belong in one place in the taxonomy

14

ƒWorks best with mature, formal, logical schemes

ƒExamples: Military rank, Biological, Family Genealogy

Source: Organising Knowledge (Patrick Lambe, 2007)

(8)

Representations of taxonomies (5)

ƒ

Polyhierarchies

ƒUsed when an item belongs in

ƒUsed when an item belongs in more than one place in the real world, and multiple organising principles are required. Provides “virtual linking” between

hierarchies.

ƒExample: a single collection of content concerning diseases can

15

content concerning diseases can be organised/taxonomised via affected body part and causes

Source: Organising Knowledge (Patrick Lambe, 2007) Source: Rosenfeld, Morville (2006)

© AIIM | All rights reserved

Representations of taxonomies (6)

ƒ

Matrices

ƒProvides a 2 or 3 dimensional

ƒProvides a 2 or 3-dimensional cross linking of taxonomies, and an ability to provide differing views into the same body of content.

ƒExample: The same content could be located based on project manager, project initiation, and/or

16

manager, project initiation, and/or affected standards

Source: Organising Knowledge (Patrick Lambe, 2007)

(9)

Representations of taxonomies (6)

ƒ

Facets

ƒA multi dimensional taxonomy

ƒA multi-dimensional taxonomy comprised of multiple tags, each tag representing an individual taxonomy, thus the content is categorised in multiple ways, within a single interface.

ƒExample: selecting wines based on characteristics such as type,

17

on characteristics such as type, price, varietals, regions,

appellations, and price.

Source: Organising Knowledge (Patrick Lambe, 2007) Source: wine.com

© AIIM | All rights reserved

Representations of taxonomies (7)

ƒ

System maps

ƒVisual representations of a

ƒVisual representations of a domain of knowledge

ƒLabelled representing taxonomy categories

ƒExample: A collection of medical content relating to the human nervous system is accessible via a diagram of the human body

18

a diagram of the human body. Each component of that system is illustrated in context, and labelled appropriately.

Source: Organising Knowledge (Patrick Lambe, 2007)

(10)

Defining classification

Classification:

“The systematic identification and arrangement of

business activities and/or records into categories

according to logically structured conventions,

methods and procedural rules represented in a

classification system”

19

Source: ISO 15489

© AIIM | All rights reserved

What is classification?

ƒ

In simple terms, it’s just grouping information together

ƒ

Common examples of classification:

ƒCars

ƒby make, model, performance

ƒFood

ƒtinned/fresh, type (meat, vegetable, grain)

ƒTV programmes

20

p g

ƒcomedy, thriller, quiz show

ƒClothes

ƒadult/child, expensive/cheap, winter/summer

(11)

Dewey Decimal system

ƒ

Used to classify information

throughout the western

Dewey Decimal system

000 General & Bibliography

throughout the western

world

ƒ

Very Euro-centric

000 General & Bibliography

100 Philosophy & Psychology

200 Religion

300 Social Science

400 Languages & Linguistics

500 Sciences

21 © AIIM | All rights reserved

600 Technology

800 Literature

900 Geography & History

Chinese library classification

ƒ

43,600 categories. Constantly expanding to meet the

needs of a rapidly changing nation

1) Marxism, Leninism, Maoism & Deng Xiaoping Theory 2) Philosophy and Religion 3) Social Sciences 4) Politics and Law 5) Military Science 6) Economics

12) Natural Science

13) Mathematics, Physics and Chemistry

14) Astronomy and Geoscience 15) Life Sciences

16) Medicine and Health Sciences 17) Agricultural Sciences

ƒ

Political considerations drive some organisation

22

6) Economics

7) Culture, Science, Education, and Sports

8) Languages and Linguistics 9) Literature

10) Art

11) History and Geography

17) Agricultural Sciences 18) Industrial Technology 19) Transportation

20) Aviation and Aerospace 21) Environmental Science

(12)

US Library of Congress

ƒ

Used to categorise books published in the United States

ƒ

Expanded categories emphasise USA-specific history

and interests

A) General Works

B) Philosophy, Psychology, Religion C) History: Auxiliary Sciences D) History: General and Old World E) History: United States

F) History: Western Hemisphere

L) Music M) Fine Arts

N) Literature & Languages O) Science P) Medicine Q) Agriculture 23 G) Geography, Anthropology, Recreation H) Social Science I) Political Science J) Law K) Education R) Technology S) Military Science T) Naval Science

U) Bibliography & Library Science

© AIIM | All rights reserved

What are classification schemes?

ƒ

A classification scheme…

ƒIs the structure an organisation uses for organising

ƒIs the structure an organisation uses for organising, accessing/retrieving, storing and managing its information

ƒCan be used to classify records

ƒ

A Business Classification Scheme (BCS) is a

classification scheme based on an organisation’s

business functions and activities

24

ƒThese are predominately used for Records Management purposes

(13)

Classification schemes: Types

Keyword /

Deployment

P

Hierarchical / thesaurus-based Functional Subject / thematic

P

rinciples of

clas

s

Generally preferred tree style 25

s

ification

Organisational

© AIIM | All rights reserved

Hierarchical / tree style BCSs: Key

ƒ

CLASS

C

ƒ

FILE

ƒ

RECORD

F R 26

ƒ

DOCUMENT

D
(14)

Schematic example: Hierarchical / tree BCS

C C C C C C C C C C C C C C C C C C 27 F F F F F F F F

© AIIM | All rights reserved

Populated example: Hierarchical / tree BCS

ƒInnovation, Knowledge Transfer and Technical Infrastructure (super ( p function)

ƒInnovation (function)

ƒKnowledge Transfer (function)

ƒTechnical Infrastructure (function)

ƒStandards and Accreditation (sub

function)

ƒ Policy Management (activity)

Super Function

Function Function Sub

Function FunctionSub FunctionSub

28

Activity Activity Activity Activity

ƒ Infrastructure Support (activity)

ƒNational Measurement System

(sub function)

ƒ Policy Management (activity)

ƒCivil Space Activity (sub function)

ƒ Space Regulation (activity)

(15)

Agenda

ƒ

Defining taxonomies and classification

ƒ

Subject-based classification

ƒTaxonomies

ƒFolksonomies

ƒOntologies

ƒThesaurus and Semantic networks

ƒ

Business case for classification

29

ƒ

Business case for classification

ƒ

Standards and guidelines

ƒ

Classification challenges

© AIIM | All rights reserved

Toward subject-based classification

ƒ

It’s often valuable to create multiple classifications

ƒ Users: Intended audience

ƒ Content: Inherent subject matter

ƒ Context: Temporal, organisational or political drivers

ƒ

User-understood terms are critical

ƒEspecially important for e-commerce

ƒPeople search Google for “cheap flights” 75x th “l f ” (S G M G )

30

Source: Louis Rosenfeld LLC

more than “low fares” (Source: Gerry McGovern)

ƒWho are the users? Scientists? Consumers?

ƒ

Context matters

ƒWhy this user with this content?

(16)

Taxonomies in context

31 © AIIM | All rights reserved

Source: Yahoo!

Hierarchies as implicit semantics

ƒ

Divides information space into categories &

subcategories, relating broader & narrower concepts via

subcategories, relating broader & narrower concepts via

parent-child relationship

ƒGeneric = Class-species: Species B (crow) is a member of Class A (Bird) & inherits characteristics of its parent}

ƒWhole-Part = B is a part of A (i.e., Index Finger is part of Hand)

ƒInstance = B is an instance of A (i.e., Indian Ocean is an Ocean)

32

A A

B B

(17)

Differing views

ƒ

Simple truth: People see (and label!) the world

differently…

Sand trap, or bunker?

33

Sand trap, or bunker?

© AIIM | All rights reserved

Personal taxonomy

• Personal classification of information • E-mail folders -- most common

manifestation manifestation

• Can improve relevance and findability to an individual

– Some approaches enable personal classification in addition to “authorised” taxonomy

– Gmail and some other systems employ faceted classification as well

• From enterprise perspective, personal taxonomies can be quite problematic

No interoperabilit ling istic chaos

34

– No interoperability, linguistic chaos – Impossible to establish enterprise-wide

standards and vocabularies

• When combined with peers, can become a “folksonomy”

(18)

Folksonomy

ƒ

Collaborative tagging of content with minimal controls

ƒ

Relevance between metadata and content may be

determined by users in a democratic fashion

ƒ

Clusters emerge and communities typically self-organise

around them (“Wisdom of the crowd”)

ƒ

Typically arise in Web-based communities where

i di id

l

h

t

t th

t

d

t

35

individuals share content, then create and use tags

ƒ

Best used when there is a critical mass of taggers

ƒCan be a useful “bottom-up” approach to developing taxonomies

© AIIM | All rights reserved

Folksonomy example

36

Source: flickr.com

(19)

What is an ontology?

ƒ

Explicit specification or conceptualisation of a domain

ƒOften subsume thesauri, but employ richer semantic relationshipsOften subsume thesauri, but employ richer semantic relationships

among terms and attributes

ƒApply rigid rules specifying terms and relationships

ƒDo more than just control vocabulary; are a knowledge representation

ƒ

Semantic technologies are typically centered around

ontologies

ƒ

An ontology for salad would contain the structure for

37

ƒ

An ontology for salad would contain the structure for

how it relates to everything, from ingredients to growers

to the rodents that might eat it, and how a salad is

different in Japan vs. Italy

© AIIM | All rights reserved

Why develop an ontology?

ƒ

To improve knowledge sharing and reuse, and make

software more adaptable to an environment

software more adaptable to an environment

ƒShare common understanding of the structure of information among people or software agents

ƒEnable reuse of domain knowledge

ƒMake domain assumptions explicit

ƒSeparate domain knowledge from operational knowledge

38

p g

ƒAnalyse domain knowledge

Source: http://www.alphaworks.ibm.com/contentnr/introsemantics

(20)

The challenge of meaning

ƒMeaning is a hard problem for machines and humans alike ƒSame term can have multiple meaningsSame term can have multiple meanings

ƒMultiple terms can have the same meaning

ƒUltimately meaning is contextual

ƒDublin Core designed to disambiguate at a fundamental level

ƒE.g., distinguishes definitively among “Creator” and “Contributor,” and

“Publisher”

39

ƒBut in the wild, it is much harder to achieve semantic agreement

© AIIM | All rights reserved

Controlled vocabularies

ƒ

Supporting tools based on collections of terms used to

tag, track and describe content

tag, track and describe content

ƒ

For example, users may wish to organise content

according to

ƒbusiness sector ƒgeographical location ƒproduct type 40 ƒorganisation type ƒpolicy topic

ƒ

Allow content to be described using only 'official terms'

(21)

Controlled vocabularies: Types

ƒ

Simple lists

ƒLists of terms allowed to be used to describe an information resource

ƒLists of terms allowed to be used to describe an information resource

ƒ

Synonym rings

ƒA 'ring' of connected terms, all treated as equivalent for searching

ƒSynonym rings can be used to link acronyms, variant spellings or scientific / popular terms

ƒ

Thesaurus

41

Thesaurus

ƒHierarchical arrangement of broader and narrower meanings

© AIIM | All rights reserved

Simple lists and synonym rings

ƒ

Simple list of bovine diseases

ƒAnaplasmosis

ƒAnaplasmosis

ƒBabesiosis

ƒBovine spongiform encephalopathy (BSE)

ƒCysticercosis

ƒ

Synonym ring for a BSE

BSE Mad cows’ disease Bovine spongiform encephalopathy 42 Prion disease

(22)

Thesaurus

ƒ

A networked collection of controlled vocabulary terms,

using associative relationships

ƒUsed to manage and identify the relationships among and between terms

ƒE.g. Equal to, Related to, Opposite of

ƒ

Some examples from a hypothetical domain

ƒLettuce = Frisée (a.k.a, ‘a synonym ring’)

ƒLettuce is a narrower type of Greens

43

ƒCoriander is related to Cilantro; but they are not equal

ƒ

Useful to reconcile different lexicons across business

units or functional groups

© AIIM | All rights reserved

Sample thesaurus

44 © AIIM | All rights reserved

(23)

Ontologies and taxonomies and thesauri

ƒ

How does this relate to Taxonomies and Thesauri?

ƒ“We have all agreed to call this thing lettuce Lettuce is a vegetable ”

ƒ We have all agreed to call this thing lettuce. Lettuce is a vegetable.

ƒ

There is a much larger potential pool of semantic

information that a taxonomy may or may not contain:

ƒ“Lettuce grows in the ground. Rabbits are a hazard to lettuce growers.

Tomatoes and cucumbers are often eaten with lettuce, and the three of these things together make what is called a salad. But, a salad is not only defined by the collection of these three things in Japan a mixture

45

only defined by the collection of these three things…in Japan, a mixture of seaweed and sesame seeds is a salad. In the Midwestern United States, a collection of Jell-O and radishes is called a salad, and there is no lettuce involved.”

© AIIM | All rights reserved

Agenda

ƒ

Defining taxonomies and classification

ƒ

Subject-based classification

ƒTaxonomies ƒFolksonomies ƒOntologies

ƒThesaurus and Semantic networks

ƒ

Business case for classification

46

ƒ

Business case for classification

ƒ

Standards and guidelines

ƒ

Classification challenges

(24)

Benefits of classifying records (1)

ƒ

Providing linkages between individual records which

accumulate to provide a continuous record of activity

p

y

ƒ

Ensuring records are named in a consistent manner over

time

ƒ

Assisting in the retrieval of all records relating to a

particular function or activity

ƒ

Determining security protection and access appropriate

47

g

y p

pp

p

for sets of records

ƒ

Allocating user permissions for access to, or action on,

particular groups of records

© AIIM | All rights reserved

Benefits of classifying records (2)

ƒ

Distributing responsibility for management of particular

sets of records

sets of records

ƒ

Distributing records for action

ƒ

Determining appropriate retention periods and

disposition actions for records

48 © AIIM | All rights reserved

(25)

Agenda

ƒ

Defining taxonomies and classification

ƒ

Subject-based classification

ƒTaxonomies ƒFolksonomies ƒOntologies

ƒThesaurus and Semantic networks

ƒ

Business case for classification

49

ƒ

Business case for classification

ƒ

Standards and guidelines

ƒ

Classification challenges

© AIIM | All rights reserved

Standards and guidelines

ƒ

ISO 15489 - the international standard for records

management

g

ƒ

MoReq2 - the Model Requirements for the Management

Of Electronic Records

ƒ

DIRKS - the Design and Implementation of

Record-Keeping Systems methodology

ƒ

ISO 2788 - Guidelines for the Establishment of

50

Monolingual Thesauri

(26)

Agenda

ƒ

Defining taxonomies and classification

ƒ

Subject-based classification

ƒTaxonomies ƒFolksonomies ƒOntologies

ƒThesaurus and Semantic networks

ƒ

Business case for classification

51

ƒ

Business case for classification

ƒ

Standards and guidelines

ƒ

Classification challenges

© AIIM | All rights reserved

Classification challenges (1)

ƒ

Laborious and difficult to develop

ƒTendency to over analyse

ƒTendency to over-analyse

ƒ

May need more than one

ƒ

Classification of content into a

categorisation scheme is ongoing

work

52 © AIIM | All rights reserved

(27)

Classification challenges (2)

ƒ

Categories need ongoing care and feeding

(including thesauri, taxonomies, controlled vocabularies

(including thesauri, taxonomies, controlled vocabularies

and ontologies)

ƒContent changes, Context changes

ƒVocabularies change

ƒExperience may breed new perspectives

53 © AIIM | All rights reserved

What you have learned

ƒ

How to leverage classification in general and taxonomies

in particular as part of an ECM strategy

in particular as part of an ECM strategy

ƒ

Different approaches to subject-based organisation

schemes:

ƒTaxonomies ƒThesauri ƒSemantic networks 54 ƒOntologies ƒFolksonomies

ƒ

Managing classification challenges

References

Related documents

• agree, in consideration of Vanderbilt allowing my child to participate in SAVY, to hold harmless and indemnify Vanderbilt and its trustees, agents, officers, servants, and

The middle figure shows the result when the starting model was a fully locked fault and in the bottom figure the starting model was a fully free-slip fault. This was done to see

We explore the effect of not using Facebook for one week on five outcomes: social media usage, news consumption, news awareness, subjective well-being, daily activities and the value

Estimates suggest that (i) resource abundance increases inequality after controlling for per-capita income, (ii) higher inequality reduces the average level of human development in

The National Institutes of Health’s renewal of the five-year grant to support the Atlanta Clinical & Translational Science Institute (ACTSI) will enable Emory University,

The World Wide Web (also known as the internet) has introduced a new dimension to legal research. Unlike before when researchers buried their heads in the labyrinth

We then learned about the Laplace transform, which is a useful tool for solving differential equations and for doing system analysis on continuous-time systems.. Our development

In simple aqua-planet configuration (CMIP5) : large inter-model differences in the position of tropical convergence zones (present-day climate & response to +4K)!. We have