• No results found

DATA GOVERNANCE AND DATA QUALITY

N/A
N/A
Protected

Academic year: 2021

Share "DATA GOVERNANCE AND DATA QUALITY"

Copied!
20
0
0

Loading.... (view fulltext now)

Full text

(1)

Kevin Lewis – Partner

Enterprise Data Management COE

Barb Swartz – Account Manager

Teradata Government Systems

DATA GOVERNANCE AND

DATA QUALITY

(2)

Objectives of the Presentation

Show that Data Governance and Data

Quality are part of a larger EDM

function

Provide a process framework for effective

Data Quality Management

Explain the role of Data Governance and

Stewardship in a Data Quality function

Provide advice on aligning a Data Governance program to

(3)

EDM Framework

A Path to Integrated and Trusted Information

Data

Governance

Data

Stewardship

Integrated

and

Trusted

Information

• 

Data Governance –

The practice of organizing and

implementing principles, policies, procedures and

standards for the effective use of data

• 

Data Stewardship -

Continual, day-to-day activities

of creating, using, and retiring data

• 

Data Quality –

Ensure data is fit for its intended use

• 

Data Integration –

Includes Data Acquisition (ETL/

ELT) processing to combine transaction and master data

to provide a consistent, meaningful, and trusted view of

the data across business units and subject areas

• 

Data Security and Privacy –

Information security,

data privacy and regulatory compliance across data

subject areas, including monitoring and audit capabilities

• 

Metadata Management –

The people, processes

and technical components necessary to ensure that

metadata is easily accessible, consistent, current,

accurate, timely and complete

• 

Master Data Management –

Management of

master data domains, such as Product and Customer data,

that provide context for transactional data

• 

Data Architecture –

The logical and physical data

modeling plus other activities needed to understand

People, Processes, and

Technology

Data

Integration

Data

Architecture

Quality

Data

Master

Data Mgmt

Metadata

Mgmt

Data

Security

and

Privacy

(4)

Data Governance, Data Stewardship, and

Enterprise Data Management

Data Governance provides

oversight for

Enterprise Data Management

(EDM)

Data Stewardship provides

the day-to-day business

involvement for EDM

activities

Data

Governance

Data

Stewardship

Integrated

and

Trusted

Information

Data

Integration

Data

Architecture

Quality

Data

Master

Data Mgmt

Metadata

Mgmt

Data

Security

and

Privacy

(5)

Data Quality

• 

Accuracy –

data represents reality correctly

• 

Completeness –

data gaps are minimized and data subjects are

covered adequately

• 

Timeliness –

data is stored in system within an acceptable

time from the business event

• 

Consistency –

data is defined and reported with the same

meaning and values across the enterprise

The core dimensions of

data quality are:

Data Governance determines the focus of data

quality improvements based on business value

Data Stewards provide business understanding

of assigned data subjects

Data

Governance

Data

Stewardship

Integrated

and

Trusted

Information

Data

Integration

Data

Architecture

Quality

Data

Master

Data Mgmt

Metadata

Mgmt

Data

Security

and

Privacy

(6)

Dimension Description

Conformance

Non-Conformance

Accuracy A measure of information

correctness A balance of $10,000 is stored as a balance $10,000. A balance of $10,000 is stored as a balance of $12,500. Consistency A measure of the degree of conflicts

that exist in situations with redundant data

A balance of $10,000 in the ABC system is also stored as $10,000 in the XYZ system.

A balance of $10,000 in the ABC system is also stored as $12,500 in the XYZ system.

Entirety A measure of the quantities of entities created, versus the real world or the number of actual events

All phone calls that were made were

recorded and stored for billing. Calls to a particular NPA-NNX were not recorded due to a switch profile problem. Revenue for these calls will be lost.

Breadth A measure of the amount of information captured about an object or event

All information about a specific call is captured including duration, start and stop time, origination and termination information, billing information, network information, etc.

None of the network related information for a specific call is captured. Nothing is known about how the call was handled by the network.

Completeness A measure of information caps

within a specific entity occurrence Name, age, and occupation are known for all customers. Name and age are known for all customers but occupation is known for only 50% of the customers.

Uniqueness A measure of unnecessary

information replication Customer information is stored once for each customer. Certain customers’ records are duplicated due to variations in the spelling of the name, alternate address, etc. The records are not linked in any way.

Interpretability A measure of semantic standards

being applied A date is stored as 11 June 2002 A date stored as 11062002 is interpreted as November 06, 2002. Timeliness A measure of how current a record

is All customer addresses represent the current place of dwelling. Many customers have changed their address without informing the company. Precision A measure of exactness The amount of tax due for this specific

transaction is $0.104. The amount of tax due for this specific transaction is stored as $0.10. Depth A measure of the amount of entity

of event history that is retained A complete history of orders, bills, and payments is retained for all customers. Orders, bills, and payment information is only retained for one year. Each month, the prior year records are deleted for that month to make room for the new information.

Integrity A measure of validity with respect to

another item of related information A call detail record contains a from number of (404) 240-9999. The Terminating Point Master table indicates that due to an area code split, the 240 NNX is now in the 770 NPA.

(7)

Business Objective:

Control

out-of-stocks and inventory carrying

costs

Action:

Provide order suggestion

to grocery stock clerk based on

forecasted sales and current

inventory balances

Data Problem:

Incorrect inventory balances in system

Root Cause (example):

Cashier not correctly identifying

produce item

Fix:

Label loose produce item with lookup code and GS1 Data Bar

Finding the problem (profiling):

Find unusual percentage

breakdown in sales data for certain produce categories

Monitoring (scorecarding):

Establish rule and threshold for

expected percentage breakdown versus actual

(8)

Data Quality Improvement Process Model

Step 3:

Analyze

Step 4:

Trace Root Causes

People

Process

Information

Technology

Step 6:

Monitor & Trend

Error

count

Time

Step 2:

Profile

Value

No. of

Errors

Step 1:

Select & Define

Step 5:

(9)

Data Governance and Stewardship Roles for

Data Quality

Step 3:

Analyze

Step 4:

Trace Root Causes

People

Process

Information

Technology

Step 6:

Monitor & Trend

Error

count

Time

Step 2:

Profile

Value

No. of

Errors

Step 1:

Select & Define

Step 5:

Fix Root Causes

Data Governance

Council determines

appropriate focus

Data Steward

brings business

meaning of data

… helps interpret

profiling results

… helps determine

business root

causes

… approves IT fixes

and facilitates

business change

… monitors data

quality and initiates

improvement

(10)

Technology Enablers for Data Quality

Step 3:

Analyze

Step 4:

Trace Root Causes

People

Process

Information

Technology

Step 6:

Monitor & Trend

Error

count

Time

Step 2:

Profile

Value

No. of

Errors

Step 1:

Select & Define

Step 5:

Fix Root Causes

Data Quality

Scorecarding /

Monitoring tools

Data Profiling Tools

Match / Merge tools

MDM

Data Enrichment

Input Controls

(11)

The Role of the Data Warehouse in Data

Quality Improvement

Source

Databases

Business

process

Business

process

Business

process

DW

DW uses

(CRM, Data

Mining, etc.)

DQ Scorecard

Awareness,T

raining,

Motivation

System &

Process

changes

Data

Cleanse

(12)

Data Management Organization

Executive Steering

Committee

B

u

si

n

ess

In

te

lli

ge

n

ce

C

om

pe

te

n

cy

C

en

te

r

Data Stewards

Data Governance

Council

IT

Business

Executive Steering Committee –

Provides ultimate authority needed to

unify information across the

organization

Data Governance Council –

Represents the entire organization to

facilitate efforts that unify information

Data Stewards – Works across

business areas and systems to ensure

integrity of assigned data subjects

Business Intelligence Competency

Center – Provides information and

analytical services to the enterprise

The structures shown here are

primarily business-focused. IT

supports these organizations by

ensuring that IT solutions are in

place that enable each area of EDM.

(13)

Data Domain

Primary Role

Sales

Customer

Asset

Finance

Location

Campaign

etc.

Data Owner

Data Steward

IT Data Steward

Business Area

BICC

Marketing

Purchasing

Operations

Sales

Accounting

Customer

Service

Europe

South America

etc.

Data Stewardship Matrix

Names go in

these boxes

(14)

Building Data Governance

Assess current

capabilities

(P, P, & T)

Capture data

issues

for DG

Identify

projects

to benefit from DG

Prioritize and

plan capability

improvements

Develop process to

resolve data issues

Develop process to

link to projects

Implement

capability

improvements

Building Capability

to Sustain and Increase Business Value

(15)

Data and Capabilities are Deployed

Incrementally to Support Business

Initiatives

Application 1

Application 2

Application 3

Project 1

Project 2

Project 3

Capability 1

Capability 2

Capability 3

Data Domain 1

Data Domain 2

Data Domain 3

Each data domain supports

one or more functional

projects while simultaneously

providing more data to BI

users

Projects that deploy

data (e.g., sales data,

inventory data)

Data Warehouse

Projects that use

data (e.g., Supply

Chain Management,

Personnel,

Maintenance)

BI Users

Access

Projects that

deploy capability

(e.g., DQ, MDM,

Data

Stewardship)

(16)

Implement

complete

solution,

including

DQ, MDM,

etc.

Build data

quality

monitoring

with

thresholds

Ensure

proposed

solution

architecture

meets

standards

Perform

detailed

data

profiling on

required

elements

Perform

high level

data

profiling on

proposed

sources

Prioritize,

resolve, and

communicate

data issues

Communicate

changes using

Data

Stewardship

Network

Support ongoing

data quality

program;

maintain

metadata

Integration with the

System Development Life Cycle (SDLC)

Plan

Analyze

Design

Build

Implement Manage

Data Quality and related activities should be embedded in projects;

these are just a few examples:

Design

data

quality

rules and

include in

SLA

Capture

business

metadata

and design

mechanism

to deliver

Roadmaps

and PPM

help us plan

for each

project

(17)

AF Global Combat Support Systems

Data Services

Overview

>

Supports information sharing across all domains, services, &

DoD agencies

>

Offers role-based “on demand” access to data

>

Provides designated authoritative data repository of current &

historical data

>

Provides data transformation & integration

>

Utilize Commercial Off-the-Shelf (COTS) based solution

>

Net-centric environment

The Environment

>

Over 19TBs of user data spread across more than 95 databases

>

Acquiring data from over 108 sources processing over 50

million rows of data daily

Mostly batch interfaces, but do support Change Data

Capture to meet near real time requirements

>

Data Analytics

Business Objects, Cognos, 19 “High Profile” Rich

Internet Applications supported by Web Services

>

Providing access to multiple USAF Communities and

(18)

Teradata  Corpora+on  

We invented Data Warehousing

>

Global Leader in Enterprise Data

Warehousing

>

Positioned in Gartner’s Leaders

Quadrant in data warehousing since

1999

We pioneered the Active Data

Warehouse Market

>

Extending traditional data

warehousing for operational

intelligence

Global presence and world-class

customer list

>

More than 1,000 customers

>

10 years at USAF

>

More than 2,500 installations

7,000 associates

Traded on NYSE (TDC)

Integrated Solution

Implementation Services

Architecture Consulting

Services

Business

Consulting Services

H

ar

d

w

ar

e

S

o

ft

w

ar

e

Prof

.

Serv

ices

Storage

Server

Analytic Applications

Support Services

Database Software

(inc. Tools and Utilities)

(19)
(20)

References

Related documents

As a small business owner, I feel Little Acorn Media gives me the broadest reach for my advertising dollars. It is amazing to see how much Little Acorn Media continues

See, Zeran, 129 F.3d, at 333 (noting that if notice could be used to trigger litigation, the ISPs would respond by removing the speech on the Internet, even if the speech is

According to the international experience, federal authorities can carry out six groups of functions for support of mechanisms of development of innovative

Step 1: Click “Middle Button” button in “Oscar Main Menu” >> click and select “Macro.. Manager” property

Th e questionnaire consists of twenty eight questions which are divided into six sections, which are students’ perception of dentistry as a professional career (three

[r]

The mothers of children with autism showed abnormal watching behavior, spending less time than control mothers watching the ROIs on three of the four movies (Puzzle Rabbit,

The Australian Census, for example, employs a number of unique data collection and enumeration techniques to estimate the prevalence of primary (sleeping rough or living in an