• No results found

Understanding Data Warehouse Needs Session #1568 Trends, Issues and Capabilities

N/A
N/A
Protected

Academic year: 2021

Share "Understanding Data Warehouse Needs Session #1568 Trends, Issues and Capabilities"

Copied!
26
0
0

Loading.... (view fulltext now)

Full text

(1)

Understanding Data Warehouse

Needs Session #1568

Trends, Issues and Capabilities

Dr. Frank Capobianco

Advanced Analytics Consultant

Teradata Corporation

Tracy Spadola CPCU, CIDM, FIDM

Practice Lead - Insurance Industry Consulting

Teradata Corporation

(2)

ANTITRUST Notice

The Casualty Actuarial Society is committed to adhering strictly to the letter and spirit of the antitrust laws. Seminars conducted under the auspices of the CAS are designed solely to provide a forum for the expression of various points of view on topics described in the programs or agendas for such meetings. Under no circumstances shall CAS seminars be used as a means for competing companies or firms to reach any

understanding – expressed or implied – that restricts competition or in any way impairs the ability of members to exercise

independent business judgment regarding matters affecting competition.

It is the responsibility of all seminar participants to be aware of antitrust regulations, to prevent any written or verbal discussions that appear to violate these laws, and to adhere in every respect to the CAS antitrust compliance policy.

(3)
(4)

4 > Teradata Proprietary & ConfidentialTeradata Proprietary & Confidential

Some Insurance Industry Drivers

Competitive Landscape

> Protection & Growth of Market Share

> Need for Better Pricing and Segmentation Strategies

> Merger & acquisition vs. Organic growth

Claims Costs

> Catastrophe losses

> Rising healthcare costs

> Need to contain overall Claims Expenses

Current Financial Crisis

> Market and Credit Crisis

> Potential Loss of Consumer Confidence

(5)

5 > Teradata Proprietary & ConfidentialTeradata Proprietary & Confidential

New Insights

Using the data warehouse to “Close the Books”

> Many organizations have made the warehouse a key component of

month-end close, giving them the ability to close the books faster and earlier in the month-end cycle

– Some can close the books at any time, and have results in minutes

– Statistical / Regulatory Reporting from the Data Warehouse

Understanding the “Quote to Bind” process

> Many organizations are stepping up analysis to capture ALL quote

iterations for better understanding of acquisition process and costs

Moving to a Customer-Focused Organization

> Insurance companies has typically been organized by product.

> Insight into customer portfolios, cross sell / upsell, better risk

(6)

6 > Teradata Proprietary & ConfidentialTeradata Proprietary & Confidential

The Data Drivers

Enhanced Customer Intelligence

> Single view of the Customer

> Move from Product to Customer View

Increased / Improved Risk Analysis

– Improved Underwriting / Pricing

– Catastrophe Modeling

– Global warming issues

– Enterprise Risk Management

Increased reliance on Predictive Modeling Capabilities

> Segmentation

> Loss characteristics

> Fraud & Abuse

Protection / Growth of Market Share

(7)

7 > Teradata Proprietary & ConfidentialTeradata Proprietary & Confidential

New Insights with New Data Types

Telematics

>

Capturing real time telematics for pricing and risk assessment for

“Pay as You Drive” programs

Unstructured Text

> Analysis of Claims / Adjuster notes for better Loss and Fraud

analysis

> Analysis of Call Center records for better customer service /

marketing “Voice of the Customer” analysis

Geographic Information Systems (GIS)

> Leading US Insurers using geo-spatial data to support risk

exposure analysis (concentration / location), reinsurance and regulatory compliance

Dynamic Financial Analysis

> Risk (financial, market, credit) modeling for improved financial

(8)

Data Warehousing Challenges and

Capabilities

(9)

9 > Teradata Proprietary & ConfidentialTeradata Proprietary & Confidential

• Campaign development and customer segmentation

• Customer behavior assessment using predictive modeling

• Event-based marketing • Channel analysis • Personalization • Identification of upselling and cross-selling opportunities • Assessment of channel, region and agent

performance

• Agent loyalty assessment

• Using internal and third-party data to price products to risk • Underwriting: pricing analysis,

frequency/severity analysis and exposure analysis

• Risk profiling

• Customer information analysis to determine service preferences

• Personalized and knowledgeable interaction via all channels as information is delivered to agents, brokers and other partners

• Claims performance and claims workflow assessment • Fraud identification and detection

Why do Insurance Companies Use / Need

Data Warehouses?

Servicing Servicing Servicing Policy Issuance Policy Policy Issuance Issuance Pricing/ Underwriting Pricing/ Pricing/ Underwriting Underwriting Sales Sales Sales Marketing & Pre-Sales Marketing Marketing & & Pre

Pre--SalesSales

• Financial analysis • Regulatory reporting • Reinsurance analysis • Operational performance analysis Operations Operations Operations Gartner 2008 – BI Conference

(10)

10 > Teradata Proprietary & ConfidentialTeradata Proprietary & Confidential

“Five Stages of an Active Data Warehouse Evolution”, Stephen Brobst and Joe Rarey, Teradata Magazine Online

OPERATIONALIZING WHAT IS happening now? ACTIVATING MAKE it happen! Link to Operational Systems Automated Linkages REPORTING WHAT happened? PREDICTING WHAT WILL happen? ANALYZING WHY did it happen? Batch Reports Ad Hoc, BI Tools Predictive Models Batch Ad Hoc Analytics Continuous Update/ Short Queries Event-Based Triggering

Life-Cycles of Data Warehousing

Workload

Complexity

(11)

11 > Teradata Proprietary & ConfidentialTeradata Proprietary & Confidential

Warehouse Reference Architecture –

The Target Topology

Warehouse Data Transform Layer Transactional Data Data Transformation Business and Operational Users Detail Data Layer Semantic or Usage View Layer View View CUSTOMER CUSTOMER NUMBER CUSTOMER NAME CUSTOMER CITY CUSTOMER POST CUSTOMER ST CUSTOMER ADDR CUSTOMER PHONE CUSTOMER FAX ORDER ORDER NUMBER ORDER DATE STATUS

ORDER ITEM BACKORDERED QUANTITY

ITEM ITEM NUMBER QUANTITY DESCRIPTION ORDER ITEM SHIPPED QUANTITY

SHIP DATE Co-located

Dependent Mart Virtual Marts External Dependent Mart ELT Strategic

Users TacticalUsers UsersOLAP Event-driven/Closed Loop Analytic Apps.

Dimensional View

(12)
(13)

13 > Teradata Proprietary & ConfidentialTeradata Proprietary & Confidential

“Five Stages of an Active Data Warehouse Evolution”, Stephen Brobst and Joe Rarey, Teradata Magazine Online

OPERATIONALIZING WHAT IS happening now? ACTIVATING MAKE it happen! Link to Operational Systems Automated Linkages REPORTING WHAT happened? PREDICTING WHAT WILL happen? ANALYZING WHY did it happen? Batch Reports Ad Hoc, BI Tools Predictive Models Batch Ad Hoc Analytics Continuous Update/ Short Queries Event-Based Triggering

Business is driven by need for actionable

information

Workload

Complexity

(14)

Teradata Proprietary & Confidential

Time to Build & Deploy Models Weeks to Months

Business Value

A Typical Analytic & Predictive Process

70% of the Development Process Data Data Extraction Extraction Model Model Development Development Data Data Preparation Preparation Development ADS Production ADS Data Data Extraction Extraction Model Model Deployment Deployment Data Data Understanding Understanding Business Business Understanding Understanding Time to Solution

(15)

Teradata Proprietary & Confidential

Competitive Advantage

Typically 60-80% of the Time Data Data Preparation Preparation Data Data Exploration Exploration Business Business Understanding Understanding Training ADS Data Data Gathering Gathering Operational Operational Analytic Insight Analytic Insight Time To Intelligence

A Typical Analytic & Predictive Process

Training ADS Model Model Deployment Deployment Model Model Model Model Development Development Model Model Translation Translation

(16)

Teradata Proprietary & Confidential Competitive Advantage Business Business Understanding Understanding Training ADS Operational Operational Analytic Insight Analytic Insight Time To Intelligence

Gain Agility, Speed and Quality

Model Model Deployment Deployment SAS Model SAS Model Model Model Development Development Model Model Translation Translation

(17)

Teradata Proprietary & Confidential

Current Environment Challenges

Desktops Analytic Servers Source Data Disk farms

ƒ Data volumes & complexity

continues to grow exponentially

ƒ Many tools replicate data on to

an analytic server/data mart for processing

ƒ Inefficient process: Slow

Time-to-Market

ƒ Inconsistency throughout your

(18)

Teradata Proprietary & Confidential

The Cost of A Suboptimal Environment

As data complexity and volumes grow, so does the cost of

building analytic models.

– How much time is your analyst

spending on data movement and SQL programming vs. analysis?

– How much time is model development taking?

– How much time is your IT

department spending extracting data for your analytic modelers?

– Is data movement saturating your network?

– Is your analytic server under capacity?

– How long does it take to get the results of your predictive model to your business users?

$

Data Volume/Complexity

Cost of Analytics

ROI of Analytics

(19)

Teradata Proprietary & Confidential 19

Optimal Analytic Environment

- exploits in-database processing

Provides enterprise analytic capabilities to efficiently develop and deploy advanced analytics

Delivers a flexible, scalable environment for enterprise analytics

• Performs appropriate tasks on appropriate platform

9 Data intensive tasks (exploration, preparation & scoring) in the database

• Leverages partner products integrated with the database

9 Partner products to leverage the database for data prep & scoring

9 Minimize data movement into an analytic mart or server

9 Execute model and applications directly in the database

1) Data Exploration & 2) Data Preprocessing Perform data

Intensive steps in the database

Analytic Data Set

3) Model Development

Build your model in-database for efficiency or use your favorite data mining tool

4) Model Scoring Export Code or PMML model from your

favorite data mining tool

OR

PMML

(20)

Teradata Proprietary & Confidential Aggregations, Joins, Sorts

Transformations

Data Exploration and Data Preprocessing

ƒ Know your data prior to

extracts

– Explore and identify your data relevant to your application

– Avoid pulling irrelevant tables and columns

ƒ Prepare data prior to extracts

– Perform derivations, aggregations and transformations then extract the results

ƒ Leverage in-database tools to

streamline process

– Easy to use environment for Analyst and IT team

– Automated SQL generation

– Reusable metadata in-database

Optimized Environment

Data Exploration and Preprocessing Outside the Database

Aggregations, Joins, Sorts, Transformations

(21)

Teradata Proprietary & Confidential

Analysis and Advanced Analytic Modeling

Leverage tools like SAS for its strength: Analytics

Analytic Server for Model Development

Enterprise Data Warehouse

(22)

Teradata Proprietary & Confidential

In-Database Model Deployment

Model Deployment on an analytic server

Enterprise Data Warehouse

Model Deployment In-database

ƒ

Suboptimal methods:

(1)Extract data and

score outside the database

(2) Scoring code

re-written in SQL and submitted to RDBMS

ƒ

Optimal method:

– Use automatic score

generation

capabilities of data mining tools, and run them

in-database against the Analytic Data Set

(23)

Teradata Proprietary & Confidential

Data Preparation

Transform and aggregate data in the database with Teradata ADS Generator

Model Deployment

Deploy your SAS model as a In-DBS Function in the Teradata database

• Export SAS models as Teradata functions for in-database scoring

• Eliminate labor-intensive model translation efforts

• Get to better answers faster

• Visualize your sample analytical data store

• Build analytical models more efficiently

• Produce consistent results that are “actionable”

• Leverage Teradata for merge, joins, transformations, etc.

• Reduce the size and frequency of

data movement

• Build a sharable and reusable ADS

• Know your data before extracting

• Eliminate redundancy and unused data

• Analyze large complex tables quickly

Data Exploration

Explore all the data directly in the database with Teradata Profiler

Model Development

Sample your ADS data and build your model on a server with SAS Enterprise Miner

CUSTOMER CUSTOMER NUMBER CUSTOMER NAME CUSTOMER CITY CUSTOMER POST CUSTOMER ST CUSTOMER ADDR CUSTOMER PHONE CUSTOMER FAX ORDER ORDER NUMBER ORDER D A T E STATUS

ORDER ITEM BACKORDERED QUANTITY

ITEM QUANTITY DESCRIPTION ORDER ITEM SHIPPED

QUANTITY SHIP DATE

Debt<10% of Income Debt=0%

Good Credit Risks Bad Credit Risks Good Credit Risks Yes Yes Yes NO NO NO Income>$40K In-DBS Functions Profiler

Profiler Scoring AcceleratorScoring Accelerator

ADS Generator ADS Generator

Access To Teradata Access To Teradata

Analytic

Modeler EnterpriseConsumer

Example: SAS/Teradata Partnership

Modeling ADS Analytical Model Enterprise Miner Enterprise Miner Production ADS

(24)

Teradata Proprietary & Confidential

Outcome and Benefits

Key Points

Discover Financial Services operates the Discover Card, America’s cash rewards pioneer Discover Network with millions of

merchants and cash access locations

PULSE ATM/debit network with more than 260,000 ATMs, and serving over 4,500 financial

institutions

Business Challenge

• Each analyst pulled data from 1000s of source files for analytics

• Slow and inefficient model development process required 3+ weeks to complete

• Challenges with scoring extended runtimes to 7+ days

• Inconsistent data compromised the results

• Built a 1400 variable analytical data store in Teradata

• Reduced data preparation from 3+ weeks to 90 minutes

• Model development time was reduced from 14 to 2 weeks

• Accelerated analytic model runtime from 7+ days to 36 minutes

• Trained 450 analysts on better leveraging SAS and

Teradata, with plans to increase analytic output by10-fold from 40 to 500 models

(25)

Teradata Proprietary & Confidential

Case Study: Warner Home Video

Outcome and Benefits

Key Points

Forecasting New DVD Demand

The window of opportunity is short. With a major DVD release, 45 per cent of the total video sales happen in

first week, and 75 per cent within the first month Using the combined strength

of SAS and Teradata, some smart forecasting helped Warner Home Video sell 68

million DVDs of the three Harry Potter movies, and lead the industry with sales

of $4.75B

Business Challenge

Accurately estimating the demand and shipping of new movie titles

A small window of opportunity and constantly changing

predictors presented significant challenges

New release forecasts were requiring 36 hour runtimes for approximately 300 DVD titles

Higher sales with forecasting decisions based on analytic intelligence

Using the combined strength of Teradata and SAS, the same new release forecast now takes an hour and 15 minutes

Similar performance gains were realized across other scoring activities, ie from 2 hours to 1 minute

(26)

Teradata Proprietary & Confidential

Contact Information

Dr. Frank Capobianco has many years experience as an advanced

analytics consultant for Teradata. In that role, Dr. Capobianco has helped clients solve data mining problems, such as posed by forecasting, fraud detection or marketing campaign development. His clients span the financial, insurance, telecommunications,

retail, travel, transportation, and manufacturing industries. Dr. Capobianco received his Ph.D. in mathematics from the

University of Virginia.

Email Frank at [email protected]

Tracy A. Spadola is the insurance practice lead in the Industry

Consulting Group. She has worked across multiple industries to help develop Enterprise Data Management strategies and uncover Business Value for these projects. Ms. Spadola is the current

president of the Insurance Data Management Association (IDMA).

References

Related documents

Oven Temperature Checks I. Procedure: Record the temperature of the oven using a NIST traceable thermometer. Adjust the temperature dial if necessary and recheck. Frequency:

Genocide is an ancient crime. Yet, it is a modern concept. As a political scientist and a grandson of an orphan of a genocide, I wrestle with the challenge of how to explain genocide

Effect of static radial magnetic field on natural convection heat transfer in a horizontal cylindrical annulus enclosure filled with nanofluid was investigated numerically using

Advanced Papers Foundation Certificate Trust Creation: Law and Practice Company Law and Practice Trust Administration and Accounts Trustee Investment and Financial

Although the corrosion rate of weathering steel is much lower than conventional carbon steel it cannot be totally discounted and allowance for some loss of section over the life of

Strategy #4: Review and update comprehensive park plan Action Step #1: Identify financial costs of plan components Action Step #2: Research funding options. Action Step #3:

** this refers to military/defense operations or applications (such as weapons procurement, ballistic missile technology) not to general office management such as general accounting