Big Data – Are You Ready?
Jorge Plascencia
Solution Architect Manager
Thoughts Devices Processes
Thoughts Things Processes
Big Data: The Datafication Of Everything
Run the Business
Organize data to do something specific
Change the Business
Take data as-is to figure out what it can do
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.
3
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.
4
Continuous Innovation
Big Data at Work Make Your Business
Unique
Why Is Big Data Important?
Source: * McKinsey Global Institute: Big Data – The next frontier for innovation, competition and productivity (May 2011)
US HEALTH CARE
$300 B
“In a big data world, a competitor that fails to sufficiently develop its capabilities will be left behind.”
Increase industry value per year by
McKinsey Global Institute
US RETAIL
60+%
Increase net margin by
MANUFACTURING
–50%
Decrease dev., assembly costs by
GLOBAL PERSONAL LOCATION DATA
$100 B
Increase service provider revenue by
EUROPE PUBLIC SECTOR ADMIN
€250 B
Increase industry value per year by
What Makes it Big Data?
VOLUME VELOCITY VARIETY VALUE
SOCIAL
BLOG
SMART METER
101100101001
001001101010
101011100101
010100100101
Make Better
Decisions Using
Big Data
Big Data in Action
ANALYZE
DECIDE ACQUIRE
ORGANIZE
Acquire all
available data Big Data in Action
ANALYZE DECIDE
ORGANIZE
ACQUIRE
Oracle NoSQL Database
Key value pair database Dynamic data model
Highly scalable, available Transparent load balancing Built using BerkeleyDB
Nodes
East
Nodes
West
Nodes
Central
Nodes NoSQL Driver
Application
NoSQL Driver Application
Nodes
…
…
Read
Delete Read Update
Big Data in Action
ANALYZE DECIDE
ORGANIZE ACQUIRE
Oracle NoSQL Database
Organize and distill big data using massive parallelism
Big Data in Action
ANALYZE
DECIDE ACQUIRE
ORGANIZE
Hadoop Architecture
Management/Monitoring
Hadoop Distributed File System (HDFS) MapReduce
Distributed file system with redundant storage
Map/Reduce programming paradigm
Highly scalable data processing
Cost-effective model for high
volume, low density data
A Map/Reduce Pipeline
SHUFFLE /SORT
SHUFFLE /SORT MAP
MAP MAP MAP
SHUFFLE /SORT
REDUCE REDUCE
SHUFFLE /SORT
SHUFFLE /SORT
REDUCE REDUCE REDUCE
INPUT 2 INPUT
1
OUTPUT 2 OUTPUT
1 MAP
MAP
MAP
MAP
MAP
REDUCE
REDUCE
REDUCE
MAP MAP MAP
MAP MAP MAP
REDUCE REDUCE MAP
MAP MAP MAP MAP
REDUCE REDUCE REDUCE
Oracle Data Integrator
Reduces Hadoop complexities
through graphical
tooling
Oracle Loader for Hadoop
SHUFFLE /SORT
SHUFFLE /SORT MAP
MAP MAP MAP
SHUFFLE /SORT
REDUCE REDUCE
SHUFFLE /SORT
SHUFFLE /SORT
REDUCE REDUCE
REDUCE INPUT
2 INPUT
1
MAP
MAP
MAP
MAP
MAP
REDUCE
REDUCE
REDUCE
MAP MAP MAP
MAP MAP MAP
REDUCE REDUCE MAP
MAP MAP MAP MAP
REDUCE REDUCE REDUCE
Big Data in Action
ANALYZE
DECIDE ACQUIRE
ORGANIZE
Oracle NoSQL Database Oracle Enterprise Manager Oracle Data Integrator
Oracle Loader for Hadoop
Analyze all your data, at once
Big Data in Action
ANALYZE
DECIDE ACQUIRE
ORGANIZE
ANALYZE
R Statistical Programming Language
Open source language and environment
Used for statistical
computing and graphics
Strength in easily producing
publication-quality graphs
Highly extensible
Oracle R Enterprise Approach
Models run in-database
Processes large data sets
Uses the power of Oracle
Database 11g and Exadata
Same code, much faster
Big Data in Action
ANALYZE
DECIDE ACQUIRE
ORGANIZE ANALYZE
Oracle NoSQL Database Oracle Enterprise Manager Oracle Data Integrator
Oracle Loader for Hadoop
Oracle R Enterprise
Decide based on real-time big data
Big Data in Action
ANALYZE
ACQUIRE
ORGANIZE
DECIDE
Integrated Analytics Platform
One Platform, One Model, Multiple Delivery Channels
Common Enterprise Information Model
BI Server & Information Discovery
Advanced dashboard visualization, Runs BI and EPM applications, Integrated Security, User Management, Personalization Embed R script’s web interface in BI dashboard, Multidimensional Calculation and Integration Engine
Graphics will stream to BI dashboard, Intelligent Request Generation and Optimized Data Access Services
OLTP & ODS
Systems Data Warehouse, Datamart
Exadata Oracle, Siebel,
PeopleSoft, Custom Excel
XML
Big Data
OLAP Ad-hoc
Analysis Interactive
Dashboards Reporting &
Publishing Detect &
Alert Mobile &
Disconnected Office
Integration Search Embedded
Oracle Business Intelligence Enterprise Edition + Integrating R Analytics
Unstructured Analytical ! § Benefits
§ Unprecedented visibility of information
§ Leverage existing investments in BI
§ Self-service data discovery
§ Best business decisions & reduced IT costs.
§ Unique Features
§ Contextual search, Navigation, Analytics
§ Dynamic data and metadata
§ Content acquisition and text enrichment
§ Memory performance
Fast & Intuitive Data analysis and exploration
from any combination of sources Structured & Unstructured#
Oracle Endeca Information Discovery
Oracle Integrated Solution Stack for Big Data
ACQUIRE
Oracle NoSQL Database
HDFS
Enterprise Applications
ORGANIZE
Hadoop
(MapReduce)
Oracle Loader for Hadoop
Oracle Data Integrator
DECIDE
Analytic Applications
ANALYZE
In-Database Analytics
Data
Warehouse
Oracle Big Data Platform
Copyright © 2012, Oracle and/or its affiliates. All rights reserved.
26
MEDIA/
ENTERTAINMENT Viewers / advertising effectiveness
COMMUNICATIONS Location-based advertising
EDUCATION &
RESEARCH Experiment sensor analysis CONSUMER
PACKAGED GOODS
Sentiment analysis of what’s hot, problems
HEALTH CARE Patient sensors, monitoring, EHRs Quality of care LIFE
SCIENCES Clinical trials Genomics HIGH TECHNOLOGY /
INDUSTRIAL MFG.
Mfg quality Warranty analysis
OIL & GAS Drilling exploration sensor analysis
FINANCIAL SERVICES
Risk & portfolio analysis New products
AUTOMOTIVE Auto sensors reporting location, problems
RETAIL Consumer sentiment Optimized marketing
LAW ENFORCEMENT
& DEFENSE Threat analysis - social media monitoring, photo analysis
TRAVEL &
TRANSPORTATION Sensor analysis for optimal traffic flows Customer sentiment
UTILITIES Smart Meter analysis for network capacity,
Sample of Big Data Use Cases Today
ON-LINE SERVICES / SOCIAL MEDIA People & career matching Web-site optimization
Challenged by: Data Volume, Velocity, Variety
Big Data Use Cases
Today’s Challenge New Data What’s Possible
Healthcare
Expensive office visits Remote patient monitoring Preventive care, reduced hospitalization Manufacturing
In-person support Product sensors Automated diagnosis, support Location-Based Services
Based on home zip code Real time location data Geo-advertising, traffic, local search
Public Sector
Standardized services Citizen surveys Tailored services, cost reductions Retail
One size fits all marketing Social media Sentiment analysis segmentation