• No results found

Measuring Data Quality Using ISO 8000

N/A
N/A
Protected

Academic year: 2021

Share "Measuring Data Quality Using ISO 8000"

Copied!
17
0
0

Loading.... (view fulltext now)

Full text

(1)

Slide 1

Measuring Data Quality Using

ISO 8000

Peter R. Benson

Executive Director

ECCMA

Project leader for ISO 8000 and ISO 22745

International Association of Data Quality Managers

(2)

When you order seafood from

Quality Fresh Seafood, you can be confident that you are receiving the very best quality of seafood and delivery.

(3)

Slide 3

3.1.1 quality

degree to which a set of inherent characteristics fulfils requirements

3.1.2

requirement

need or expectation that is stated, generally implied or obligatory

ISO 9000:2005(E)

ISO 9000 Definition of “Quality”

Quality data is data that meets stated requirements

(4)

ISO 8000

General Principles Part 1 Introduction Part 2 Terminology Master Data Part 100 introduction Part 110

Syntax Semantic encoding requirements Meets

Part 120 Provenance Part 130 Accuracy Part 140 Completeness Transaction data

The International Standard for Data Quality

(5)

Slide 5

ISO 8000 Quality Data Summarized

Data that meets requirements

(6)
(7)

Slide 7 Terminology Terminology Terminology GAO-05-734SP Budget Glossary MISMO SEC EDM council Terminology

• Public domain concept identifiers

• Free identifier resolution to underlying terminology (web services)‏

• Hyperlink to source standards • Multilingual

• Multiple terms, definitions and images linked to single concept identifier

(8)

ISO 22745 - ECCMA Open Technical Dictionary (eOTD)

The eOTD (ECCMA Open Technical Dictionary) is an ISO 22745-20 compliant central registry of terminology. Each concept and terminological component in the eOTD is

assigned a unique and permanent public domain identifier.

Users create corporate preferred subsets of the eOTD and use the eOTD concept

identifiers to manage concept equivalence mapping with the concepts used by their

trading partners

.

Publicly Visible Terminology in a Standard Model

Terminology

Terms Abbreviations Definitions Images

Public Domain Concept Identifier 0161-1#xx-xxxxxx#1

(9)

Data requirements support a business function, granting access

to a computer, a website or a software program, simply asking

for the data needed to deliver the right product or service or to

comply with a regulation, these are all data requirements.

Be careful what you ask for – data quality starts with the quality

of the request for data

Data Requirement (ISO 22745-30)

Data requirement

eOTD-i-xml ISO 22745-30

(10)

Clear and unambiguous requests for:

1.

Reference data (identifiers)

2.

Characteristic data (descriptions)

3.

The validation of reference and characteristic

data,

ISO 22745-35 is a standard format for the generation

and distribution of requests for data in a simple XML

format that can be automated by the sender and

recipient to create an integrated data exchange system.

Request for Authoritative Sourced Data (ASD)

Request for data

eOTD-q-xml ISO 22745-35

(11)

ISO 22745-40 is a standard format for the

exchange of

reference and characteristic data

in a simple XML

format that can be automated by the sender and

recipient to create an integrated data exchange system.

Reply to a Request for Data

Data requirement eOTD-i-xml ISO 22745-30

Request for data eOTD-q-xml ISO 22745-35

Data exchange

eOTD-r-xml ISO 22745-40

(12)

Data requester

Data provider

Sub

A data provider may not have all the data requested so they in

turn send a request through their data supply chain using the

same ISO 22745 standard exchanges

Request for data

eOTD-q-xml ISO 22745-35

Data exchange

eOTD-r-xml ISO 22745-40

Request for data eOTD-q-xml ISO 22745-35 Data exchange eOTD-r-xml ISO 22745-40 Data requirement eOTD-i-xml ISO 22745-30

(13)

Slide 13 ISO 22745 ISO 8000

Portable data

Quality data is portable data; it is

independent of the software application

and accessible by any application

Data de-dup app Data quality analysis app Data validation app

Portable data - the Antidote to Application “lock-in”

User

Application

Operating system

(14)

Motivation for ISO 22745 and ISO 8000

Controlling costs requires better

asset, product, component and

process visibility. This is achieved

through faster, better and lower cost

access to authoritative sourced data.

(15)

Slide 15

Motivation for ISO 22745 and ISO 8000

Controlling costs requires better

asset, product, component and

process visibility. This is achieved

through faster, better and lower cost

access to authoritative sourced data.

(16)

The data shall be ISO 8000-110:2009 compliant.

The data shall comply with registered ISO 22745-30 compliant data requirements

The data shall be encoded using concept identifiers from an ISO 22745 compliant open technical dictionary that supports free resolution to concept definitions.

The data shall be provided in ISO 22745-40 compliant Extensible Markup Language (xml).

The contractor, sub-contractor or supplier shall, as and when requested to do so,

supply technical data in electronic format on any of the items covered in this

contract as follows:

• Providing the data necessary for the safe and efficient operation of plant and

equipment is an established legal requirement in most countries.

• Creating ISO 8000-110:2009 compliant data does not require the payment of any

license fees or the use of specialized software, it is within the technical ability of all suppliers regardless of their size.

(17)

Slide 17

1. Document your business language

2. Document your requirements for data

3. Measure the quality of your data

(by comparing your data to your requirements for data)

There are low cost software tools

that make this process very straight forward

References

Related documents

Our approach leverages data mining techniques in order to explore medical databases of previously encoded secondary diagnoses and use the stored structured information (age,

In this paper, we proposed a computational application of the concept of urban vitality defined by Jane Jacobs, using mobility data sourced from the Oyster card as a proxy for

Using Business Rules techniques in data quality validation By using the ECA-rules Business Rules concept to explicitly model the necessary checks to measure the quality of a

We evaluate the quality of the public contracts data on those quality characteristics defined in the ISO 25012 standard that were suitable to the type of data under study and

Borrowing from these definitions, we propose a definition for big data software as “software that supports the timeconstrained processing of continuous

g) shall be subject to appropriate technical and organisational measures against unauthorised or unlawful processing of personal data and against accidental loss or destruction

This Indian Standard (First Revision) which is identical with ISO 7200 : 2004 ‘Technical product documentation — Data fields in title blocks and document headers’

Credential Registry – More than a database, the Registry collects and connects credential data described with CTDL and supports and an open applications marketplace by turning