• No results found

Traditional vs Big Data Analytics

N/A
N/A
Protected

Academic year: 2021

Share "Traditional vs Big Data Analytics"

Copied!
16
0
0

Loading.... (view fulltext now)

Full text

(1)

CIO Council National Conference 2015 1

Luis Reina Juliá

[email protected]

@luisrei

(2)

2

CIO Council National Conference 2015

Some Data History….

Internal Data

Structured Data

Stable Data Models

Historical Data

Edgar Frank Codd

OLTP

Databases

Operational Systems

Informational Systems

Relational

Model

1970

(3)

3

CIO Council National Conference 2015

INSTRUMENTED

INTERCONNECTED

INTELLIGENT

But The World have changed becoming more…

The resulting explosion of information creates a need for

a new kind of intelligence

(4)

4

CIO Council National Conference 2015

2+

billion

people

on the

Web by

end 2011

50 billion

RFID

tags

(1.3B in 2005)

4.6

billion

camera

phones

world

wide

100s of

millions

of GPS

enabled

devices

sold

annually

76 million

smart

meters in 2009…

200M by 2014

12+ TBs

of tweet data

every day

25+ TBs

of

log data

every day

?

TBs

of

d

ata

ev

er

y

d

ay

Of world’s data

is unstructured

80

%

(5)

5

CIO Council National Conference 2015

Da

ta V

ol

ume

Ignorance Zone Grows Everyday

(6)

6

CIO Council National Conference 2015

Data is the new oil.”

Clive Humby

“We have for the first time

an economy based on a

key resource [Information]

that is not only renewable,

but self-generating.

Running out of it is not a

problem,

but drowning in it

is

.”

John Naisbitt

(7)

7

CIO Council National Conference 2015

Big Data Analytics

Iterative & Exploratory

Data is the structure

IT Team

Delivers Data

On Flexible

Platform

Business

Users

Explore and

Ask Any Question

Analyze ALL Available Information

Whole population analytics

connects the dots

Traditional Analytics

Structured & Repeatable

Structure built to store data

Business

Users

Determine

Questions

IT Team

Builds System

To Answer

Known Questions

7

Available Information

Analyzed Information

Capacity constrained down sampling

of available information

Carefully cleanse a small information

before any analysis

Analyzed Information

Traditional vs Big Data Analytics

Analyze information as is & cleanse as

needed & existing repeatable

Analyzed Information

(8)

8

CIO Council National Conference 2015

Big Data Analytics

Iterative & Exploratory

Data is the structure

Traditional Analytics

Structured & Repeatable

Structure built to store data

8

Traditional vs Big Data Analytics

?

Analyzed Information

Question

Data

Answer

Hypothesis

Start with hypothesis

Test against selected data

Data leads the way

Explore

all

data, identify correlations

Data

Correlation

All Information

Exploration

Actionable Insight

(9)

9

CIO Council National Conference 2015

Stream Computing: New Paradigm

Real time analysis of data-in-motion

Streaming data

Stream of structured or unstructured data-in-motion

Stream Computing

Analytic operations on streaming data in real-time

Historical fact finding

with data-at-rest

Batch paradigm, pull model

Query-driven: submits queries to static data

Relies on Databases, Data Warehouses

Traditional Computing

Stream Computing

Queries Data Results Queries Data Results Queries Data Results

Queries

Data

Results

(10)

10

CIO Council National Conference 2015

Big Data: Most of the Data is unstructured

Most of the data is unstructured o semistructured.

In social networks there are a lot of discussion about products and

services.

A lot of organizations information is kept on LOBs, description fields,

this information is never used.

How can I get metrics from unstructured text ?

I had an iphone, but it's dead

@JoaoVianaa.

(I've no idea where it's) !

Want

a

blackberry

now !!!

@rakonturmiami

im

moving to miami

in 3

months.

i look foward to the new lifestyle

I'm at Mickey's Irish Pub Downtown

(206 3rd St, Court Ave, Des

Moines)

w/ 2 others http://4sq.com/gbsaYR

(11)

11

CIO Council National Conference 2015

The Magic of Analyzing Unstructured Data

I like Android a lot

I am going to buy

a new pone soon

……..…..

Twitters

Magic

Analyze

Mobile Distribution

Obtain

the

relevant information

from the nonstructured data.

Eliminate noise.

Format

relevant information in a format that it can be:

Analyzed, Enriched, Correlated with other information.

(12)

12

CIO Council National Conference 2015

World Cup 2010

Netherlands

Striker

Arjen Robben

Keeper

Spain

Iker Casillas

Winger

Andres Iniesta

Spain

World Cup 2010

Highlights

Football

World Cup 2010

, one team distinguished well

from the rest winning the final. Early in the second

half,

Netherlands’

striker

,

Arjen Robben

, had a chance

to score, but the awesome

keeper

for

Spain

,

Iker

Casillas

made the save. Winner superiority was

reflected when

Winger

Andres Iniesta

scored for

Spain

(13)

13

CIO Council National Conference 2015

What is Big Data and What is not?

TRADITIONAL

DATA

BIG DATA

Relational Database

OLTP

ERP

…….

The Limit is not 100% clear

Social Networks

Sensor Data

(14)

14

CIO Council National Conference 2015

DATA that because of its

VOLUME,

VELOCITY or

VARIETY (format)

Is

DIFICULT

o

UNPRACTICAL

ANALYZE

With TRADITIONAL means.

(15)

15

CIO Council National Conference 2015

New Technologies and not so new

Zoo keeper

(16)

16

CIO Council National Conference 2015

Some Conclusions

Traditional Data Warehouse will remain.

But Analyzing Traditional Data could not be enough.

Analyzing Big Data can be a competitive Advantage.

References

Related documents

In addition, time series models, such as GARCH-type models, fitted to financial data can possibly overlook some memory properties of the magnitude of change in financial returns

The main wall of the living room has been designated as a "Model Wall" of Delta Gamma girls -- ELLE smiles at us from a Hawaiian Tropic ad and a Miss June USC

• Government support and sponsorship: the singapore government and its major partners like the singapore economic Development Board (eDB), international

coated mild steel con firmed an improved corrosion resis- tance with a reduction in corrosion rate from 9.7425 mm/ year to 0.10847 mm/year, coating ef ficiency of 98.9%, polari-

On both cataclinal and anaclinal slopes surface features are similar in nature, but different in occurrence and expression: features parallel to the strike of the slope appear

The most abundant native bee is the common eastern bumble bee, Bombus impatiens Cresson 1863, which was the only bee observed in all community gardens sampled ( N = 19)

We used pollination exclusion on flowers or inflorescences on a whole plant basis to assess the contribution of insect pollination to crop yield and quality in four flowering

Measure-valued processes, Continuous-state branching processes, Fleming-Viot pro- cesses, Immigration, Beta-Coalescent, Generators, Random time change.. Mathematics