• No results found

SPSS Modeler Integration with IBM DB2 Analytics Accelerator

N/A
N/A
Protected

Academic year: 2021

Share "SPSS Modeler Integration with IBM DB2 Analytics Accelerator"

Copied!
17
0
0

Loading.... (view fulltext now)

Full text

(1)

SPSS Modeler Integration with IBM DB2 Analytics

Accelerator

Markus Nentwig

(2)

Agenda

1 Motivation

2 Basics

IBM SPSS Modeler

IBM DB2 Analytics Accelerator (IDAA)

3 My Work

Task Overview

Fraud Prediction for Banking Scenario

(3)

New information out of old transactions!?

Example: retail business website:

Customers who bought book A also bought book X and Y

Market Basket Analysis Questions:

How does it work? What are the problems?

Possible solution to Market Basket Analysis

(4)

IBM SPSS Modeler

Data Mining workbench to discover knowledge in databases

Tool for Data Mining: IBM SPSS Modeler Scan all transactions made in past

find associations, propose them to new customers Market Basket Analysis example:

(5)

IBM DB2 Analytics Accelerator (IDAA)

Data Warehouse appliance powered by Netezza technology

System z196 connected to IDAA Accelerate specific (often analytic) queries Appliance makes it easy to install / operate

Figure from Redbook: Optimizing DB2 Queries with IBM DB2

(6)

IBM DB2 Analytics Accelerator (IDAA)

Computation with new approach on IDAA

Figure from Redbook: Optimizing DB2 Queries with IBM DB2 Analytics Accelerator for z/OS

OLAP-type access to data

Initial data loadoncefrom DB2

Pass query to IDAA

Massive Parallel Processing (MPP) on Snippet Blades

Data Mining on IDAA, less work on DB2

(7)

IBM DB2 Analytics Accelerator (IDAA)

Results with new IDAA approach

Iterate about whole data base

find associations

Netezza-based MPP architecture well suited

Use of IDAA ensures integration with DB2

transparent for customer Multiple Terabyte TransactionTable not moved anymore

Small resulting table (red) back to DB2

(8)

Task Overview

Subjects I worked on

Describe model build on IBM SPSS Modeler and possible new approach with IDAA

Find real scenarios and map them to both approaches Preparation tasks for performance test

(9)

Fraud Prediction for Banking Scenario

Real world business scenario

Prediction of possible credit card transaction fraud Examples:

Big transactions in abnormal time

Multiple purchases from different vendors in short time High risk country origin

1 Model Training: Check old transactions for fraudulent patterns

(10)

Fraud Prediction for Banking Scenario

Example: algorithm mapped to IDAA

Algorithm RFM-Analysis in IBM SPSS Modeler:

One node calculates values

No algorithm equivalent on IDAA side

Map RFM-Analysis to IDAA

(11)

Results

Model build accelerated using Netezza technology Business scenarios mapped to new architecture Performance measurement in progress

Related presentation on IOD:

IBM Software InformationOnDemand 2012 October 21-25 IDW-1626A zOLAP - Accelerate SPSS Modeling and Data Mining Using IDAA on z

(12)

Thank you!

Thank you for listening. Any questions?

IBM, the IBM logo, ibm.com and DB2 Analytics Accelerator are trademarks or registered trademarks of International Business Machines Corporation in the United States, other countries, or both. Other company, product, or service names may be trademarks or service marks of others.

(13)
(14)

Preparation for performance test (1)

Data preparation

Data extraction out of the given complex scheme

We only need some tables for the model creation Adaption to the needs of DB2 / IDAA

Table creation, change of data type, date format

Enlargement of data basis (from less than 100 MB to GB-TB)

(15)

Preparation for performance test (2)

Load to DB2 and also to IDAA

DB2 LOAD utility used within a JCL script on the host Accelerate (Copy) tables to IDAA with IDAA Studio \\

LOAD

EXEC PGM=DSNUTILB,PARM=DBNI

. . .

LOAD DATA INDDN INPUTD

REPLACE

LOG NO

ENFORCE NO

FORMAT DELIMITED

INTO TABLE NENTWIG.TABLE_NAME (

PARAM TYPE,

. . .

)

(16)
(17)

Preparation for performance test (4)

Implement applied algorithms on Netezza

Much pre-defined functionality with IBM SPSS In-Database Analytics like

Discretization, normalization Decision trees, association rules Different clustering algorithms and so on Exploit and adapt to work like in SPSS Modeler Example discretization:

CALL nza..EFDISC(’outtable=RFM_BOUNDS,

intable=SOURCE, bins=5

incolumn=RECENCY_DAYS;FREQUENCY;MONETARY’);

CALL nza..APPLY_DISC(’outtable=RFM,

intable=SOURCE,

btable=RFM_BOUNDS, replace=false’);

Figure

Figure from Redbook: Optimizing DB2 Queries with IBM DB2 Analytics Accelerator for z/OS

References

Related documents

The procedure includes an Expert Modeler that automatically identifies and estimates the best-fitting ARIMA or exponential smoothing model for one or more dependent variable

The company deployed IBM® SPSS® Modeler and IBM SPSS Modeler Server software in-house – replacing third-party analytics services and enabling the company to develop more advanced

• Access SPSS Statistics graphs and reporting tools directly from the SPSS Modeler Professional interface Data preparation • Access operational data from Cognos

IBM® SPSS® Modeler Server supports integration with data mining and modeling tools that are available from database vendors, including IBM Netezza, IBM DB2 InfoSphere Warehouse,

IBM® SPSS® Modeler Server supports integration with data mining and modeling tools that are available from database vendors, including Oracle Data Miner, IBM DB2 InfoSphere

Only with SPSS Modeler Premium can you directly access text, web and survey data and integrate these additional types of data into your predictive models for more useful

• No-cost use of IBM SPSS Modeler Premium data mining and text analytics software (formerly SPSS Clementine) for classroom. teaching

IBM Netezza In-Database Analytics Transformations Geospatial Predictive Statistics Data Mining More Tools In-Database Analytics SAS R Fuzzy Logix Zementis IBM SPSS BI Tools