Cisco Solutions for
Big Data and Analytics
Tarek Elsherif, Solutions Executive
November, 2015
• Major Drivers & Challengs
• Data Virtualization & Analytics
• Platform Considerations for Big Data & Analytics
• Cisco UCS Integrated Infrastructure for Big Data
• Q & A
Agenda
Big Data & Analytics
Major Drivers and Market
Number of Devices Growing Exponentially
2010 2015 2020
12.5 Billion
25 Billion
50 Billion
Accelerating Transitions – Increasing Pressure
INCREASING BUSINESS DEMANDS
TECHNOLOGY TRANSITIONS
Cloud Mobility/
Video
New Breed of Apps
Internet of Things
Big Data &
Analytics
Growth &
Productivity
New Business
Models
User Experience &
Expectations Globalization
Security &
Regulatory Compliance
Business Opportunity
As Data Grows, Leading Businesses Use It To Drive Better Outcomes
And Beat Their Competition
Other
Businesses
Business
Leaders
Bus ines s O utc omes
Data
• Customer Profitability
• Faster Time to Market
• Cost Reduction
• Risk Management
• Compliance
• Overall Agility
Business Outcomes
Business Pain
Data Silos Proliferating, Data is Now Distributed Everywhere
Traditional Data
Sources
How Does the Business Leverage All the Data?
Cloud Data Sources
Big Data/IOE Sources
• GARTNER: Organizations that modernize their
information management capability will display
20% higher financial performance (than
themselves previously)
• HARVARD BUSINESS REVIEW OCT 2012: Data-
driven Companies are 5% more productive and
6% more profitable than competitors
• GARTNER: Through 2017, 90% of the information
assets from big data analytic efforts will be siloed
and un-leveragable across multiple business
processes.
Data Virtualization & Analytics
What is Data Analytics?
.
Data analytics (DA) is the science of examining raw
data with the purpose of drawing conclusions about
that information.
Analytics Classification
Descriptive
Predictive
Analytics
Prescriptive
After the fact
- Dashboards
- Reports
How to
- Define Symptoms
- Prescribe Instructions
Future Looking
- Data Mining
- StatisticalAnalysis
- Predicting Future
Cisco and/or its affiliates. All rights reserved. Cisco Public 11
What is Data Virtualization?
Data virtualization (DV) is any approach to data management that
allows an application to retrieve and manipulate data without
requiring technical details about the data, such as how it is
formatted or where it is physically located
Development
Environment Runtime Server
Environment
Management Environment
XML
Packaged Apps RDBMS Excel Files Data Warehouse OLAP Cubes Hadoop/Big Data XML Docs Flat Files Web Services
Cisco Data Virtualization Suite
Discovery
Studio
Adapters
Business Intelligence
Customer Experience Management
Governance, Risk
& Compliance
Human Capital Management
Mergers &
Acquisitions
Single View of Enterprise Data
Supply Chain
Management Analytics
Business Directory
Cisco Data Virtualization Suite
Cisco Information Server
Manager
Deployment Manager
Monitor
Active Cluster
Data Virtualization “Plan and Build” Services
Data Virtualization “Manage” Services
Cisco Plan and Build Services for Data Virtualization Training offers are optional components of Plan and Build Services that include:
• Basic Training for Data Virtualization
• Admin Training for Data Virtualization
• Advanced Training for Data Virtualization Cisco Health Check Services for Data Virtualization Cisco Migration Services for Data Virtualization Cisco Data Virtualization Study
• Cisco Software Application Support with Upgrades (SASU)
• Cisco Data Virtualization Optimization Service
• Cisco Mission Critical Support Services
Cisco Information Server (CIS) is
the main software product with
options that include:
• Business Directory
• Active Cluster
• Adapters
• Deployment Manager
Cisco Data Virtualization Solution Overview
Licenses include:
• Production
• Development
• Staging
• Failover/backup
Services from Cisco and Cisco Data Virtualization ATP
partners help customers accelerate the time to value of
Cisco Data Virtualization with quick deployment.
Cisco Data Virtualization
Better Business Outcomes, Faster, for Less
Square Header Rounded Header
Business Intelligence/Analytics
Cisco Data Virtualization
Higher Impact More Agile Less Expensive
5-10x Faster Up to 75% Cost Savings
Immediate Access
Platform Considerations for Big Data
& Analytics
Platform for Big Data & Analytics: What is important ?
As big data
solutions become
more critical to day-
to-day decision
making, high
performance and
availability
will become
table stakes
Performance
As big data grows,
IT will need
to quickly,
cost-effectively
scale resources
Deployment
Speed
Are we able to
scale as fast as
data?
Scalability
Solution
price/performance,
operations
efficiencies, power
consumption, and
facilities footprint
will become more
important
TCO
Hundreds/thousands
of servers, switches
will require large
numbers of
management tasks
Management
© 2015 Cisco and/or its affiliates. All rights reserved.
PSODCT-2020
Platform Considerations for Big Data & Analytics
Infrastructure
Compute (2 socket Server)
Storage (Internal DAS)
Hadoop: RAID 0 + 3 Way replication MPP DB: RADI 5 + 2 Way replication
Network (dual 10Gbit)
Infrastructure
Management:
Provision
Manage
Monitor
Complementary to data tier management
Operating System
Data Management (Hadoop, MPP DBs, NoSQL)
Analytics Data Movers Visualization Virtualization
Application
Management
Cisco UCS Integrated Infrastructur e for Big Data
© 2015 Cisco and/or its affiliates. All rights reserved.
PSODCT-2020
Cisco UCS
Integrated Infrastructure for Big Data
© 2015 Cisco and/or its affiliates. All rights reserved.
PSODCT-2020
Integrated Infrastructure Solutions
• Integrated infrastructure is now an
industry standard term. Expected to
make up nearly 14% of all IT
infrastructure by 2016. (IDC)
• Cisco is the Leader in Integrated
Infrastructure
• 3
rdgeneration of Big Data solution is
now a Cisco Integrated Infrastructure
Initiative
1. IDG research
―By year-end 2015, 35% of total server shipped value will
be as integratedsystems.‖
―By 2015, converged infrastructure will represent 9.5% of the $64B services, software, and hardware
markets.‖
© 2015 Cisco and/or its affiliates. All rights reserved.
PSODCT-2020
Performance Optimized High
Performance
BigData Starter
Cisco UCS Integrated Infrastructure for Big Data
3
rdgeneration of Cisco UCS Common Platform Architecture
Capacity Optimized
Extreme Capacity
• Industry leading solution deployed across major industryverticals
• Areas of focus: Data Management Platforms (Hadoop, NoSQL, MPP Databases) andAnalytics
• Broad ecosystem partnerships with leading ISVs
• Major Hadoop distributions in GPL
• UCS Director Express
• Pre-tested, pre-validated and documented best practice designs optimized for performance
and capacity – lowering risk and TCO
• Designed to scale from small to very large as business demands
• Unified and centralized management with seamless Integration with enterprise applications
• Easy to {order, deploy, service}
© 2015 Cisco and/or its affiliates. All rights reserved.
PSODCT-2020
Parking Spot Availability
Product Information (Cached)
Compliance Recommendation
Engine
Transaction Processing Application
Front End Real-Time Price Trend Analysis Optimization
Fraud Detection
IOT Tier Mobile App Private Cloud Big Data Cold Data
CCTV
Network Plane, Control Plane — ACI, End-to-End Isolation
Management: UCS Central, Director, Manager, Express
on demand provisioning UCS Mini Cassandra Edge Processing
4 x Compute density, on demand provisioning
M-series Apache, ISS Web Server
UCS C3160 Hadoop Cold Data
20-30% better in all aspects but same
$/performance as previous generation UCS C240M4 Hadoop Big Data
1.5X transaction processing power UCS B200
M4 Oracle, SAP, MS Apps Tier
UCS C240M4 EDW
Greenplum Real-Time
Analytics Hadoop EMC,NTAP SAP HANA
UCS B460 Oracle DB Tier
EMC, NTAP, Invita
6x speed up in fraud detection by utilizing large memory UCS B460
SAS Analytics
Complete Infrastructure for End to End Analytics
© 2015 Cisco and/or its affiliates. All rights reserved.
PSODCT-2020
• Provides unified, embedded management of all software and hardware components
• Policy and model-based management, with service profiles, that improves agility and reduces risk
• Auto-discovery to detect, inventory, manage, and provision system components
• A comprehensive open XML API, which facilitates integration with third-party management tools
• Manages multiple, globally distributed Cisco UCS domains with thousands of servers from a single pane
• Provides global
configuration capabilities for pools, policies, and firmware
UCS Manager
UCS Central
End-to-end management software offers speed and
enterprise-grade reliability, while simplifying deployment and operations
Unified Management
Programmability, Scalability and Automation
Provisioning
Monitoring
Maintenance Growth
Inventory & Asset Mgmt
Fault Detection &
SW Updates
QoS Policies &
Power Capping
UCS Director
• Unified converged
infrastructure management solution
• Provides programmable application containers across computing, networking, and storage resources and extend automation benefits to the entire infrastructure stack.
UCS Director Express for Big Data
• End to end deployment tool for Hadoop
© 2015 Cisco and/or its affiliates. All rights reserved.
PSODCT-2020
UCS-D Express for Big Data
End to end solution for Hadoop
End to end provisioning, installation,
and monitoring tool for Hadoop
Clusters
Better business outcomes with faster
time to value from Big Data
Provides appliance like experience with
out inflexibilities
Centralized visibility across Hadoop
and physicalinfrastructure
Powerful interface for further integration
into third party tools and services
© 2015 Cisco and/or its affiliates. All rights reserved.
PSODCT-2020
Cisco Big Data and Analytics Partner Ecosystem
Data Management Data Integration Analytics / Business Intelligence
Hadoop
Massive Parallel Processing
NoSQL
NoSQL DB
© 2015 Cisco and/or its affiliates. All rights reserved.
PSODCT-2020
© 2012 Cisco and/or its affiliates. All rights reserved. Cisco Confidential 27