• No results found

Evolving Data Quality Management Capabilities in Complex Organizations: The Health Care Case

N/A
N/A
Protected

Academic year: 2021

Share "Evolving Data Quality Management Capabilities in Complex Organizations: The Health Care Case"

Copied!
51
0
0

Loading.... (view fulltext now)

Full text

(1)

Evolving

Evolving

Data Quality Management Capabilities

Data Quality Management Capabilities

in Complex Organizations:

in Complex Organizations:

The Health Care Case

The Health Care Case

Bruce N. Davidson, Ph.D., M.P.H.

Bruce N. Davidson, Ph.D., M.P.H.

Director, Resource & Outcomes Management

Director, Resource & Outcomes Management

Cedars

(2)

Bruce Davidson, PhD - copyright Cedars-Sinai Health System, 2007

The Context

The Context

Cedars

Cedars

-

-

Sinai Medical Center

Sinai Medical Center

Academic Medical Center/Health System

Academic Medical Center/Health System

Largest Non

Largest Non

-

-

Profit Hospital in the Western US

Profit Hospital in the Western US

950 Beds, 10,000 employees, 2000 MDs

950 Beds, 10,000 employees, 2000 MDs

Basic Annual Statistics

Basic Annual Statistics

55,000 inpatients

55,000 inpatients

280,000 outpatients

280,000 outpatients

55,000 ER visits

55,000 ER visits

7,000 deliveries

7,000 deliveries

PG 6

(3)

Complex, information

Complex, information

-

-

intensive organization

intensive organization

Distributed oversight responsibilities

Distributed oversight responsibilities

Transactional data systems populated as

Transactional data systems populated as

byproduct of patient care

byproduct of patient care

Information managed as departmental resource

Information managed as departmental resource

rather than as enterprise resource

(4)

Bruce Davidson, PhD - copyright Cedars-Sinai Health System, 2007

MISSION

MISSION

ƒ

ƒ

Patient care

Patient care

ƒ

ƒ

Teaching

Teaching

ƒ

ƒ

Research

Research

ƒ

ƒ

Community Service

Community Service

RESOURCES

RESOURCES

ƒ

ƒ

People

People

ƒ

ƒ

Money

Money

ƒ

ƒ

Equipment

Equipment

PG 8

(5)

MISSION

MISSION

ƒ

ƒ

Patient care

Patient care

ƒ

ƒ

Teaching

Teaching

ƒ

ƒ

Research

Research

ƒ

ƒ

Community Service

Community Service

RESOURCES

RESOURCES

ƒ

ƒ

People

People

ƒ

ƒ

Money

Money

ƒ

ƒ

Equipment

Equipment

ƒ

ƒ

Information

Information

(6)

Bruce Davidson, PhD - copyright Cedars-Sinai Health System, 2007

1997

1997

DPG convened to address

DPG convened to address

data crises

data crises

1998

1998

TDQM Summer Course

TDQM Summer Course

DQMWG Spun Off of DPG

DQMWG Spun Off of DPG

IQ Survey, round 1

IQ Survey, round 1

1999

1999

DQ Concept Kick

DQ Concept Kick

-

-

Off

Off

IQ Survey, round 2

IQ Survey, round 2

DQM Objectives first appear in

DQM Objectives first appear in

Annual Plan

Annual Plan

The

The

long and winding road

long and winding road

(1)

(1)

2000

2000

Big DQ Improvement Project

Big DQ Improvement Project

DQM Objectives appear again

DQM Objectives appear again

in Annual Plan

in Annual Plan

ROM Dept reorganization to

ROM Dept reorganization to

capitalize on DQ framework

capitalize on DQ framework

2001

2001

DPG & DQMWG Charters

DPG & DQMWG Charters

renewed

renewed

DQM Objectives again in

DQM Objectives again in

Annual Plan

Annual Plan

IQ Survey, round 3

IQ Survey, round 3

(7)

2002

2002

DPG thinks about pro

DPG thinks about pro

-

-

active

active

DQM infrastructure

DQM infrastructure

ROM designated as data

ROM designated as data

clearinghouse

clearinghouse

for approval

for approval

of all clinical statistics

of all clinical statistics

reported out

reported out

JCAHO accreditation

JCAHO accreditation

standards for MOI linked to

standards for MOI linked to

DQM initiative

DQM initiative

2003

2003

Data crisis: No P&Ls for 8

Data crisis: No P&Ls for 8

months

months

Initiate Data Warehouse

Initiate Data Warehouse

Improvement Project

Improvement Project

2004

2004

Propose DQMU (x2)

Propose DQMU (x2)

Implement Data Warehouse

Implement Data Warehouse

Improvement Project

Improvement Project

DQMU funded as part of ROM

DQMU funded as part of ROM

IQ Survey, round 4

IQ Survey, round 4

Data crisis: No Management

Data crisis: No Management

Reports for 5 months following

Reports for 5 months following

new PM system implementation

new PM system implementation

2005

2005

DWIP Objectives appear in Annual

DWIP Objectives appear in Annual

Plan

Plan

DQMU staffed and DQM Program

DQMU staffed and DQM Program

initiated

initiated

DWIP Project Plan institutionalized

DWIP Project Plan institutionalized

The

(8)

Bruce Davidson, PhD - copyright Cedars-Sinai Health System, 2007

2006

2006

DQ Objectives appear in

DQ Objectives appear in

Annual Plan

Annual Plan

DQ Objectives linked to

DQ Objectives linked to

executive management

executive management

incentive compensation

incentive compensation

Development of Data

Development of Data

Certification Program for

Certification Program for

High Priority

High Priority

data elements

data elements

Pilot estimation of DQ ROI

Pilot estimation of DQ ROI

Development of explicit

Development of explicit

criteria for resolving

criteria for resolving

High

High

Priority

Priority

Data Quality

Data Quality

Incidents

Incidents

The

The

long and winding road

long and winding road

(3)

(3)

2007

2007

IQ Survey, round 5

IQ Survey, round 5

117 Data Quality Incidents

117 Data Quality Incidents

logged since initiation in

logged since initiation in

February 2005, of which 82

February 2005, of which 82

have been resolved and

have been resolved and

closed

closed

Collaboration with Internal

Collaboration with Internal

Audit department relative to

Audit department relative to

minimizing risk due to

minimizing risk due to

defective data

defective data

Continuing challenge to frame

Continuing challenge to frame

strategic issues in context of

strategic issues in context of

managing information as an

managing information as an

enterprise resource

enterprise resource

(9)

What We Hope For

What We Hope For

Quick results

Quick results

Unflagging support

Unflagging support

(10)

Bruce Davidson, PhD - copyright Cedars-Sinai Health System, 2007

What We Get

What We Get

Often painfully slow progress

Often painfully slow progress

with regular periods of

with regular periods of

stagnation, if not reversal

stagnation, if not reversal

Occasional bursts of support with

Occasional bursts of support with

frequent periods of inattention, or

frequent periods of inattention, or

forgetfulness

forgetfulness

Gradually enhanced, but

Gradually enhanced, but

intermittent, cooperation

intermittent, cooperation

(11)

What Does It Mean?

What Does It Mean?

Evolutionary process

Evolutionary process

Limitations of hierarchical control

Limitations of hierarchical control

Development of

Development of

shared mental

shared mental

models

(12)

Galaxy Data Quality Program

MIT IQ Industry Symposium

July 18-19, 2007

Ingenix

United Health Analytics

Galaxy – Shared Data Warehouse

Laura Sebastian-Coleman

IS Manager – Data Quality & End User Support

(13)

Overview

ƒ

Ingenix and Galaxy

ƒ

Galaxy’s DQ program

ƒ

Evolving business needs and the pace of change

(14)

©2006 Ingenix, Inc

Ingenix Background

ƒ

A global healthcare information company

ƒ

Founded in 1996 to develop, acquire, and integrate some of the

nation’s best-in-class healthcare information capabilities

ƒ

Significant and rapidly evolving portfolio of tools and services

now transform data into actionable, fact-based,

technology-enabled decision support

ƒ

Ranked among the top 10 providers of informatics by

Healthcare

Informatics

magazine in June 2006

ƒ

Today there is an Ingenix product at work in nearly every U.S.

healthcare organization.

ƒ

Ingenix is a wholly owned subsidiary of UnitedHealth Group

(UHG).

(15)

Galaxy Overview

ƒ

Atomic Data Warehouse with transformations

ƒ

Integrates data from more than a dozen subject areas (claim,

membership, customer, provider, etc.) across multiple sources

ƒ

Size

ƒ

350 source input files from more than 25 distinct internal and external

sources (and counting)

ƒ

18 TB of data; 62 TB footprint

ƒ

3,159 attributes across 12,632 columns in 600 tables (and counting)

ƒ

Largest table: more than 1.5 billion rows

ƒ

1,704,717,031 on Claim Statistical Service as of 5/3/07

ƒ

Usage

ƒ

Over 1,000 registered users

ƒ

7,888 queries per day / 256,656 per month, on average

ƒ

Ad hoc, scheduled queries, production extracts to applications and marts

ƒ

Direct access to Galaxy via user-selected tools – Sagent is administratively

(16)

©2006 Ingenix, Inc

Galaxy Physical Architecture

Mana gemen t Se rv ic es Domai n To ol s Re so ur c e (1 76 / 49 ) S tor ag e/ B a ck up -IP ( 4 0 / 93 ) S tor ag e/ B ac k up -IP (4 0) Is ol at e d V LA N (1 92 .1 60 .0 .0/ 16 ) Is o lat ed V L A N ( 17 2. 16 .0. 0/ 1 6) PG 20

(17)

System Components

ƒ

Hardware

ƒ

7 IBM P-series Servers P575

ƒ

2 IBM P-series Servers P510

ƒ

1 IBM P-series Server P570

ƒ

4 EMC DMX 3000 Storage Cabinets

ƒ

Additional supporting servers for Sagent, Autosys, etc.

ƒ

Software

ƒ

UDB with DPF v8.2

ƒ

AIX 5.3.0

ƒ

DataStage/PX 7.0.1

ƒ

Optiload 3.1

ƒ

CoSort 7.5.3

ƒ

Autosys 4.5

ƒ

Sagent 4.5i

(18)

©2006 Ingenix, Inc

(19)

Functions of Galaxy Data

Galaxy is the single source of truth for key business functions

ƒ

Medical Trend Analytics

ƒ

Pricing

ƒ

Provider Utilization & Profiling

ƒ

Appropriateness of Care

ƒ

Network Adequacy

ƒ

Care Management / Pattern of Care / Preventive Care

ƒ

Fraud & Abuse

ƒ

Customer Reporting

ƒ

HEDIS Reporting

ƒ

Member Demographics

(20)

©2006 Ingenix, Inc

Galaxy’s Data Quality Program

ƒ

Management recognized need for DQ when Galaxy was launched

ƒ

Theoretical / methodological foundations

ƒ

Correct data problems at the source

ƒ

Data as a product

ƒ

Statistical process control

ƒ

Primary functions of DQ program

ƒ

Monitor, measure, and report on Galaxy’s Data Quality

ƒ

Recommend and implement actions based on findings

ƒ

Biggest initial challenge = establishing useful metrics

ƒ

What to measure / how to measure

ƒ

How to respond to the results of measurements

ƒ

2003 Initiated metrics & reporting program

ƒ

2004 Implemented first automated measures

ƒ

2004-2007: Deliver weekly/cyclic, monthly, quarterly, semi- annual

reporting through largely automated processes

(21)

Example of Weekly Measure

TGT_TBL_ETL_DT

In

di

v

idu

a

l V

a

lu

e

5/

24

/2

00

7

4/

2/

20

07

2/

10

/2

00

7

12

/2

7/

20

06

11

/2

/2

00

6

9/

17

/2

00

6

7/

26

/2

00

6

6/

2/

20

06

4/

14

/2

00

6

2/

23

/2

00

6

2.5

2.0

1.5

1.0

0.5

0.0

_

X=1.532

UCL=1.869

LCL=1.195

1

1

1

1

1

1

1

1

1

1

1

1

1

1

1

1

1

1

(22)

©2006 Ingenix, Inc

Quarterly Management Report

ƒ

Baseline Data Quality –

annual review and

baseline assessment

ƒ

Quality Indicators – key

attribute monitoring

ƒ

Galaxy Data Issues

Tracking

III . Galaxy Data Issues Tracking

Report Date:9/16/05

Galaxy Operations Contacts: Nancy Grimaldi, Eric Infeld, Laura Sebastian-Coleman

c.2005 UnitedHealth Group - Proprietary & Confidential

II . Quality Indicators -- Key Attribute Monitoring

Galaxy Data Quality

As of 9/16/2005

I . Baseline Data Quality -- Annual Review and Baseline Assessment Update

GALAXY

The DQ team conducted its Annual Baseline Assessment in July and August. This year’s Baseline reviewed 1483 primary and foreign keys on Galaxy’s fact tables for business expected values. The Baseline focuses on identiying and researching business illogical values and provides a benchmark for the validity and accuracy of key data. The results showed 99.73% of attributes exceeded 4 sigma, as compared with 99.44% in 2004 and 96.82% in 2003. This improvement was due to issues identified and resolved through previous baseline assessments and ongoing maintenance and quality improvement efforts. For the past two years, we have measured values based on a four sigma standard. This year we also collecteded data for five and six sigma.

• 99.46% of attributes exceeded 5 sigma • 99.12% of attributes exceeded 6 sigma 4 attributes in two subject areas did not meet 4 sigma

• One involves invalid values being populated due to issues with a source feed (see P&I request 1613). • One involves a field that was hard-coded with a default value (see P&I request 1617) • Two involve blanks in fields where blanks are not expected; these need to be researched to determine whether blanks are being supplied by the source or caused by a Galaxy process (see P&I Requests 1618 and 1619)

• Additional automated reports on this model are now being implemented on newly integrated MAMSI data. • By indicating whether Galaxy matching processes are operating as expected, these reports allow for early identification of potential data and processing issues.

• Galaxy data issues are identified by SDW business analysts or by end users, tracked through an internally maintained database, and addressed as part of ongoing operations. "Issues" reported do not include funded projects or source system changes that require a work effort on the part of Galaxy Operations.

• The total number of issues quarter-to-quarter has remained relatively steady. Currently 83 issues are identified as medium to high priority, compared to 82 last quarter. • Of the 83 identified issues, 3 are actively being resolved. This is 3.6% of identified issues, compared to 8.5% last quarter. See graph on the upper left. • The graph on the lower left shows the number of requests closed per quarter. Requests include issues, external initiatives, internal initiatives, and source system changes. 18 issues were closed this quarter compared to 12 in Q2 2005. 17 external requests were closed this quarter, as opposed to 3 in Q2 2005.

• The graph on the right shows the distribution of issues across subject areas for the second and third quarters of 2005. In all subject areas, SDW will continue to leverage project teams to address issues as opportunities are identified to do so.

• The graph to the right above tracks results from the Pharmacy Claim control and compares the level of default Member System IDs on UNet and Prime claims.

• This quarter (Q3 2005) there have not been any spikes in the Pharmacy Claim matching process. • The two dramatic spikes were caused by an increase of several hundred records above expected levels. The overall level of defaults remains at less than a tenth of a percentage. Before process changes in July 2004, the level for both source systems was around 0.10%. • Control reporting tracks key quality indicators as part of

the cyclic data prep and load process. • The Unet Claim graph (above) illustrates how well Galaxy identifies CES membership and matches members to UNet claims. This process has been operating within expected limits at this level for over a year and is now showing a strong downward trend.

ea CustomerFA OGeographyLabMembOrganizationer PharmacyProductProv ider C laim Statistical Claim F inancial 35 30 25 20 15 10 5 0 Variable In construction Outstanding

Number of Issues Outstanding By Subject Area 6-15-2005

Graph represents a point-in-time snapshot of data on 6/16/05 1

5 11 82 Issues identified 7 Solutions under construction 75 Issues outstanding 5 1 8 2 31 1 3 21 2 Load Date P e rcen t o f 0 M b r S ys I D cl a im s 8/27/2005 7/18/2005 6/12/2005 5/3/2005 3/25/2005 2/14/2005 1/2/2005 11/29/2004 10/9/2004 9/2/2004 7/28/2004 0.080 0.075 0.070 0.065 0.060 0.055 0.050 Accuracy Measures MAPE6.26364 MAD0.00411 MSD0.00003 Variable Actual Fits

Trend Analysis Plot for Percent of 0 Mbr Sys ID claims

Linear Trend Model Yt = 0.0714214 - 0.000162139*t Load Date Pe rc e n ta g e 7/28 /2005 5/3/2005 2/14 /2005 12/3 /2004 9/12 /2004 6/26 /2004 4/3/2004 1/11 /2004 10/2 4/2003 8/2/2003 5/21 /2003 0.11 0.10 0.09 0.08 0.07 0.06 0.05 0.04 0.03 0.02 Variable Prime Percentage Unet Percentage

Percentage of 0 Member System IDs on Pharmacy Claim by Source System Data as of 09-03-05 N u m b er o f R eq u es ts C lo sed 2005 - Q3 2005 - Q2 2005 - Q1 2004 - Q4 2004 - Q3 2004 - Q2 2004 - Q1 60 50 40 30 20 10 0 Quarter Issues External initiatives Internal Initiatives Source system changes Requests Closed Per Quarter 2004 - Q1 through 2005 Q3

a C ustomerFAOGeography LabMemberOrganizationPharmacyProductProv ider C laim Statistical Claim Financial 35 30 25 20 15 10 5 0 Variable In construction Outstanding Number of Issues Outstanding by Subject Area 9-15-05

83 Issues Identified 3 Solutions under construction 79 Issues outstanding

Graph represents point-in-time data on 9/16/05 1 28 2 4 1 1 2 1 30 6 7 Date Nu m b e r o f I ss u e s 9/16/2005 6/16/2005 3/11/2005 12/09/2004 9/10/2004 6/16/2004 3/01/2004 90 80 70 60 50 40 30 20 10 0 Variable Issues Outstanding Issues Identified Solutions in Construction Addressing Identified Issues in Galaxy

Graph represents point-in-time data for dates taken once per quarter. Q1 2004 - Q3 2005 Nu m b er o f at tr ib u tes an al y zed Year 2003 2004 2005 1600 1400 1200 1000 800 600 400 200 0 Variable Attributes not meeting 4 sigma Attributes meeting 4 sigma

Baseline Assessment Results 2003-2005

36 1101 8 1239 4 1480 96.82% met 4 sigma 99.44% met 4 sigma 99.73% met 4 sigma PG 26

(23)

Current Situation

ƒ

Galaxy = a mature, enterprise data warehouse

ƒ

High demand for data and for organizational services

ƒ

Galaxy’s DQ program also relatively mature

ƒ

Defined metrics

ƒ

Automated data collection

ƒ

Regular reporting

ƒ

DQ Community

ƒ

UHG growing, largely through acquisitions and

partnerships

ƒ

Healthcare industry changing – relation of government

to health care, new products, esp. consumer driven

(24)

©2006 Ingenix, Inc

Pace of Change for Galaxy

ƒ

2004

ƒ

Galaxy integrated data from MAMSI, a United Health Group acquisition

ƒ

Used the existing structure

ƒ

1+ year to integrate

ƒ

2006

ƒ

Integrated data from three new source systems

ƒ

Developed a new subject area, Revenue

ƒ

Significantly expanded Customer subject area

ƒ

Responded to healthcare industry changes

ƒ

Part D data

ƒ

HRA (Health Reimbursement Account) data

ƒ

2007

ƒ

Integrate data from additional acquisitions

ƒ

Expand the Revenue subject area

ƒ

Continue to support the use and enhancement of existing data.

ƒ

2008

ƒ

Two major integrations already scheduled

ƒ

Potential for several others

(25)

Pace of Change for Galaxy DQ

ƒ

Biggest challenge

ƒ

2003 what to measure and how to measure

ƒ

2007 how to rapidly analyze and act on DQ data

ƒ

Baseline Assessment of Galaxy Data Quality

ƒ

2003

ƒ

800 person hours to pull and analyze data for first Baseline Assessment

ƒ

Duration = more than 3 months

ƒ

Measured 1137 attributes

ƒ

2006

ƒ

Pulled 75% of data in less than 10 hours through an automated process

ƒ

Measured 1506 attributes

ƒ

Pull data quarterly

ƒ

Automated reports

ƒ

2004: 4 reports

ƒ

2007: 80 reports

(26)

©2006 Ingenix, Inc

2007 – 2008 Key UHG Business Needs

ƒ

UHG acquisitions and partnerships –

ƒ

More data for Galaxy

ƒ

More users need access

ƒ

Users need data sooner –

ƒ

Time to integrate data into Galaxy must be shortened

ƒ

Legacy data critical for ensuring reporting continuity

and analytics –

ƒ

Continued support is necessary

ƒ

Data consistency across sources critical for reporting

continuity and analytics –

ƒ

Integration methodologies need to promote and enforce

consistency

(27)

How to Respond?

ƒ

Data Quality included in set of changes to improve

efficiency and agility

ƒ

Common Interface – puts more responsibility on source

systems for data quality

ƒ

Gateway – changes how Galaxy prepares data.

ƒ

DQ measures

ƒ

More comprehensive

ƒ

Taken earlier in the process

(28)

©2006 Ingenix, Inc

Common Interface Approach

ƒ

Galaxy defines standard requirements and

layouts for data

ƒ

Sources map to these requirements and feed

to Galaxy

ƒ

Streamlined transformation/load into Galaxy

ƒ

Common model across the enterprise

(29)

Common Interface Architecture – Views

Common Interface

Table - Source 1

Common Interface

Table - Source

2

Common Interface

Table - Source

X

Existing

UNet/COSMOS/MAMSI

Table(s)

Physical Tables

(Objects)

COMMON

DATA ITEMS

Source 1 +

Source 2 +

Source N +

Existing

UNet/COSMOS/MAMSI

(common data items only;

updated field formats)

(30)

©2006 Ingenix, Inc

Gateway Integration Tool

ƒ

Facilitates mapping disparate data sources into a

Master Data Definition

ƒ

Applies generic transformation logic to the output

ƒ

Utilizes reusable transforms

ƒ

Performs automatic code generation

ƒ

Ensures consistency across source-to-target

mappings

ƒ

Provides true-to-code documentation

ƒ

Incorporates data quality modules

ƒ

Increases speed and reduces complexity of data

integrations

(31)

Gateway Integration Tool

Gateway

Mapping

and Validation

Database

Transformation

New Data

Sources

Gateway

Master Data

File Format

Standard reusable

transformations

(32)

©2006 Ingenix, Inc

Gateway – Data Quality Features

ƒ

DQ functions

ƒ

Monitor and react to events in processing

ƒ

Collect trend data

ƒ

Field validation

ƒ

Data type checking

ƒ

Value range checking

ƒ

Valid value list checking

ƒ

Assignment of default values

ƒ

Informational, error and warning messages

ƒ

File validation

ƒ

Format checking

ƒ

Field counts / record length validation

ƒ

Summary of field error and warning messages

ƒ

Thresholds of summary counts of errors and warnings that allow job

to be aborted if counts or percentages exceeded – generate alerts

(33)
(34)

©2006 Ingenix, Inc

Back to Basic DQ

ƒ

Data in the warehouse is only as good as data in the

source

ƒ

Ensuring sources to supply better data through the Common

Interface

ƒ

Manufacturing model: Data as a product produced

through a process

ƒ

Executing processes more consistently across the database

through the Gateway

ƒ

Measure to improve

ƒ

Gateway integrates and executes DQ measures consistently

across the database.

ƒ

Both tools measure ETL processes (timing of jobs, etc.) that

affect other aspects of data quality from end-to-end

(35)

DQ: Chicken or Egg?

ƒ

After 4 years – back to the beginning

ƒ

Applying theory/methodology more fully

ƒ

Applying at the beginning of integrations

ƒ

Applying more comprehensively across the warehouse

ƒ

Major re-thinking of all Galaxy processes

ƒ

Interacting with customers

ƒ

Writing specifications

ƒ

Obtain source files

ƒ

Mapping source-to-target

ƒ

Implementing ETL

ƒ

Building physical tables

ƒ

Taking DQ measures

ƒ

DQ still requires championing

ƒ

New problem: How to analyze and respond to findings from the

(36)

April 4, 2007

DSOWeb

Innovation in ETL

Transforming Raw Data into the

Building Blocks of Intelligence

(37)

About Ingenix - Data Services Organization (DSO)

The Ingenix Data Services Organization (DSO) annually

processes and integrates millions of service lines and

other claim-related data from more than 500 diverse

health and productivity related data sources for more

than 125 large employers and hundreds of small

employers. This translates to:

ƒ

4250 feeds processed in 2005

ƒ

8964 feeds processed in 2006

(38)

© Ingenix, Inc. 3

About Ingenix - DSO Processes

ƒ

Data is received across several media types related to eligibility,

medical claim (includes vision and mental health), pharmacy claim,

workers compensation, short-term disability, long-term disability,

FMLA, lab results, disease management, health risk appraisal,

payroll information, etc.

ƒ

Implementation is the design phase focusing on data layout, client

account structure, initial data quality plan, conversion rules and

successful completion of the first cycle.

ƒ

Update cycle is the periodic receipt of data from carriers/clients,

focusing on data review, investigation/resolution of inconsistencies

to ensure clean data, and loading onto analytical environment

(39)

Business and Operational Challenges

ƒ

Long turnaround time for data delivery

ƒ

Need for a cost effective data management solution

ƒ

Absence of streamlined data quality assessment and

investigation process

ƒ

Absence of company-wide standardized data intake

process

ƒ

Quality review process that is manual and subjective

leading to errors, rework and decreased confidence in

results

(40)

© Ingenix, Inc. 5

Business Needs

ƒ

Increase customer satisfaction through faster

turnaround time and higher data quality

ƒ

Higher productivity through faster learning curve

ƒ

Elimination of manual intervention and continuous

inspection of data

ƒ

Continually increasing automation

ƒ

Meet growing demand for faster data delivery and

higher level of data quality

ƒ

Efficient data profiling and data management

ƒ

Flawless data delivery to analytical environments

ƒ

Ability to leverage transformed data across multiple

Ingenix products

(41)

DSOWeb – Innovative ETL & DQ Solution

ƒ

Standardized data processing by data types – eligibility, medical,

drug, disability, WC, FMLA, HRA, lab and disease management

ƒ

Standardized incoming data into common format by data type

ƒ

Automated data quality checks

ƒ

Automated data trending

ƒ

Automated file processing and job monitoring

ƒ

User ‘friendly’ interface

ƒ

Data quality and trending failure analyses

ƒ

Transformation and mapping functionalities

ƒ

Integrated data quality investigation functionalities

ƒ

Validated client and carrier details from profiles such as:

Employee status, employee type and types of coverage

Number of covered lives and products

(42)

© Ingenix, Inc. 7

Functionality

ƒ

Enables users to create source to

common data mapping

ƒ

Point and click functionality;

functions, operators and attribute

selection easily added to

transformation box

ƒ

Dynamic Help; displays syntax

and example for each function

ƒ

On-screen listing of source

attributes, mapped attributes and

look-ups for transformations

ƒ

Text box to capture business logic

for each transformation

ƒ

All transformations are stored in

meta data tables and available for

hard copy documentation and

searches

(43)

DSOWeb – Data Quality

ƒ

Data Quality Rule engine systematically compares data

results to rule thresholds and only flags exceptions for

DSOWeb user intervention

ƒ

Metadata Driven Rules Engine - Quick turnaround for

adding, deleting or modifying rules

ƒ

More than 7,000 data quality rules tailored to each data

type

ƒ

Different rule types for column property enforcement,

structure enforcement and business rules enforcement

ƒ

Formulated from verified client claims, eligibility, lab

results, HRA and workforce productivity data experience

(44)

© Ingenix, Inc. 9

DSOWeb – Trending

ƒ

Flags unexpected trends in data. Two levels are:

ƒ

Month-to-date trending with the previous month’s data

ƒ

Year-to-date trending with the previous year’s data.

For year-to-date, the trending is done up to the month the data is being

processed

Example

: If the plan begins in January and the current processing month is

June, then trending is performed between received year’s January-June and

data for January-June of the previous year.

ƒ

Metadata driven trending rules engine; quick turn around for adding,

deleting or modifying rules

ƒ

Configurable tolerance ranges for trending rules

ƒ

Examples of tolerable ranges

Record Count

Passing Range

Less Than 10000

5% to +5%

10001 -25000

-3% to +3%

Greater Than 25000

-1% to +1%

(45)

DSOWeb – Job and File Processing Dashboard

ABC Client ABC Client ABC Client ABC Client ABC Client ABC Client Carrier A Carrier B Carrier C abc_a_a_F20051001 _T20051231_VEL60 22.txt abc_b_b_F2006040 1_T20060630_vEL2 267.txt abc_c_c_F20051201 _T20051231_V6D00 58.txt

(46)

© Ingenix, Inc. 11

Decision Making

ABC Client Abc_car1_car2_F20051001_T20051231_VEL6447.txt Car1 PG 50

(47)

DSOWeb – Integrated DQ Investigation Tool

ƒ

Data investigations and

validation environment

integrated within application

ƒ

Uses BI tool allowing users to

query converted data without

needing to understand the data

structure or a query language

ƒ

Ad-hoc and standard reports by

data type

ƒ

Automatically schedules and

creates standard reports with

each production update

(48)

© Ingenix, Inc. 13

DSOWeb – Raw Data Analysis Tool

ƒ

Raw data investigation tool

integrated within application

ƒ

Uses home grown tool that

generates SAS queries

without the need to

understand SAS

ƒ

Ability to save queries for

re-use

ƒ

Automatically emails

formatted results to end user

for printing or sharing

(49)

DSOWeb – Benefits

ƒ

Cost-effective data management solution

ƒ

Facilitates timely data investigation

ƒ

Faster turnaround time for data delivery

ƒ

Automated process eliminates human errors

ƒ

Streamlined process ensures scalability

ƒ

Metadata based DQ engine; ease of rules maintenance

ƒ

Human intervention only targeted at exception cases; increases

productivity and maximizes quality output

ƒ

Timely issue resolution

ƒ

Metrics on data quality and operational processes can be reported

as needed with current data

(50)

© Ingenix, Inc. 15

Ingenix - DSOWeb

ƒ

Thank You

(51)

Cambridge, Massachusetts, USA

The MIT 2007 Information Quality Industry Symposium

Proceedings of the MIT

2007 Information

Quality Industry

Symposium

9 - 10:30 AM

Session 1B: Federal Data Architecture

Moderator: Skip Slone, Lockheed Martin

1.

Adel Harris, Citizant, Inc

2.

Mark Amspoker, Citizant, Inc

3.

Burton Cutting, Royal Bank of Canada

References

Related documents

According to the international experience, federal authorities can carry out six groups of functions for support of mechanisms of development of innovative

5 CDDC functions transferred into DCC, clearly branded as a business focused team, but arms length company retained to maximise any future trading opportunities 

This study focused on teachers in the southern feeder area of a large school district in eastern North Carolina and evaluated whether there are significant differences between

This document may contain forward looking statements that, while used in good faith, reflect Macquarie Telecom Group Limited’s current intention, plans, expectations, assumptions

[r]

Furthermore, the research suggests that understandings of rough sleepers’ lives presented in policy and service provider discourses which see these lives as ‘chaotic’, ‘off

Though the convergence in Table 6 is less stable than for Bermudan options, the results in this section indicate that the CONV method is able to price American options under a

divisions such as HROD, Information Technology and Legislative and Legal Services. 10) The Municipality assumes all costs associated with staff-related consumables. 11) The level