2
CIO Council National Conference 2015
Some Data History….
Internal Data
Structured Data
Stable Data Models
Historical Data
Edgar Frank Codd
OLTP
Databases
Operational Systems
Informational Systems
Relational
Model
1970
3
CIO Council National Conference 2015
INSTRUMENTED
INTERCONNECTED
INTELLIGENT
But The World have changed becoming more…
The resulting explosion of information creates a need for
a new kind of intelligence
4
CIO Council National Conference 2015
2+
billion
people
on the
Web by
end 2011
50 billion
RFID
tags
(1.3B in 2005)
4.6
billion
camera
phones
world
wide
100s of
millions
of GPS
enabled
devices
sold
annually
76 million
smart
meters in 2009…
200M by 2014
12+ TBs
of tweet data
every day
25+ TBs
of
log data
every day
?
TBs
of
d
ata
ev
er
y
d
ay
Of world’s data
is unstructured
80
%
5
CIO Council National Conference 2015
Da
ta V
ol
ume
Ignorance Zone Grows Everyday
6
CIO Council National Conference 2015
“
Data is the new oil.”
Clive Humby
“We have for the first time
an economy based on a
key resource [Information]
that is not only renewable,
but self-generating.
Running out of it is not a
problem,
but drowning in it
is
.”
John Naisbitt
7
CIO Council National Conference 2015
Big Data Analytics
Iterative & Exploratory
Data is the structure
IT Team
Delivers Data
On Flexible
Platform
Business
Users
Explore and
Ask Any Question
Analyze ALL Available Information
Whole population analytics
connects the dots
Traditional Analytics
Structured & Repeatable
Structure built to store data
Business
Users
Determine
Questions
IT Team
Builds System
To Answer
Known Questions
7
Available Information
Analyzed InformationCapacity constrained down sampling
of available information
Carefully cleanse a small information
before any analysis
Analyzed Information
Traditional vs Big Data Analytics
Analyze information as is & cleanse as
needed & existing repeatable
Analyzed Information
8
CIO Council National Conference 2015
Big Data Analytics
Iterative & Exploratory
Data is the structure
Traditional Analytics
Structured & Repeatable
Structure built to store data
8
Traditional vs Big Data Analytics
?
Analyzed InformationQuestion
Data
Answer
Hypothesis
Start with hypothesis
Test against selected data
Data leads the way
Explore
all
data, identify correlations
Data
Correlation
All Information
Exploration
Actionable Insight
9
CIO Council National Conference 2015
Stream Computing: New Paradigm
Real time analysis of data-in-motion
Streaming data
Stream of structured or unstructured data-in-motion
Stream Computing
Analytic operations on streaming data in real-time
Historical fact finding
with data-at-rest
Batch paradigm, pull model
Query-driven: submits queries to static data
Relies on Databases, Data Warehouses
Traditional Computing
Stream Computing
Queries Data Results Queries Data Results Queries Data Results
Queries
Data
Results10
CIO Council National Conference 2015
Big Data: Most of the Data is unstructured
•
Most of the data is unstructured o semistructured.
•
In social networks there are a lot of discussion about products and
services.
•
A lot of organizations information is kept on LOBs, description fields,
this information is never used.
•
How can I get metrics from unstructured text ?
I had an iphone, but it's dead
@JoaoVianaa.
(I've no idea where it's) !
Want
a
blackberry
now !!!
@rakonturmiami
im
moving to miami
in 3
months.
i look foward to the new lifestyle
I'm at Mickey's Irish Pub Downtown
(206 3rd St, Court Ave, Des
Moines)
w/ 2 others http://4sq.com/gbsaYR
11
CIO Council National Conference 2015
The Magic of Analyzing Unstructured Data
I like Android a lot
I am going to buy
a new pone soon
……..…..
Twitters
Magic
Analyze
Mobile Distribution
•
Obtain
the
relevant information
from the nonstructured data.
•
Eliminate noise.
•
Format
relevant information in a format that it can be:
Analyzed, Enriched, Correlated with other information.
12
CIO Council National Conference 2015
World Cup 2010
Netherlands
Striker
Arjen Robben
Keeper
Spain
Iker Casillas
Winger
Andres Iniesta
Spain
World Cup 2010
Highlights
Football
World Cup 2010
, one team distinguished well
from the rest winning the final. Early in the second
half,
Netherlands’
striker
,
Arjen Robben
, had a chance
to score, but the awesome
keeper
for
Spain
,
Iker
Casillas
made the save. Winner superiority was
reflected when
Winger
Andres Iniesta
scored for
Spain
13
CIO Council National Conference 2015
What is Big Data and What is not?
TRADITIONAL
DATA
BIG DATA
Relational Database
OLTP
ERP
…….
The Limit is not 100% clear
Social Networks
Sensor Data
14
CIO Council National Conference 2015
DATA that because of its
VOLUME,
VELOCITY or
VARIETY (format)
Is
DIFICULT
o
UNPRACTICAL
ANALYZE
With TRADITIONAL means.
15
CIO Council National Conference 2015
New Technologies and not so new
Zoo keeper
16
CIO Council National Conference 2015