Backup and Recovery Redesign
with Deduplication
Why the move is on…
Major trends driving the transformation of
backup environments
UNABATED DATA GROWTH
Backup = 4 to 30 times production capacity
Full backups kept for months or years
Old Paradigm
20 percent resource
utilization
100% 80% 40% 0% 60% 20%CP
U Ut
ili
z
atio
n
New Paradigm
80 percent resource
utilization
100% 80% 40% 0% 60% 20%CP
U
Ut
iliz
at
ion
ESXServer HardwareShared Physical Resources
SERVER VIRTUALIZATION
Increased complexity
Virtual machine sprawl
High utilization, little bandwidth for backup
Top Server Virtualization Initiatives - 2010
Source: Enterprise Strategy Group Research Report, 2010 IT Spending Intentions Survey, January 2010
Replicate smarter
Move only deduplicated data over existing
networks for up to 99% bandwidth efficiency and
cost-effective DR.
Retain longer
Keep backups onsite longer with less disk for fast,
reliable restores, and eliminate the use of tape for
operational recovery.
Recover reliably
Continuous fault-detection and self-healing ensure
data recoverability to meet SLAs.
WAN
Retain More Backups Longer with Less Disk
Eliminate the use of tape for operational recovery
Backup
Cumulative
Estimated
Physical
Data
Logical
Reduction
Replicate Smarter Using Existing Networks
99% bandwidth efficiency
•
Move data offsite over existing networks for fastest time-to-DR readiness
•
Map method to application recovery requirements and DR policies
WAN
Deduplicated Backup
Flexible replication
On Premise
Off Premise
Recover Reliably from Disk
Highest levels of data integrity
Verification
All data is read and
verified after it is written
Backups are the data store of last resort
Improve your recovery SLAs with the advantages
of disk-based data protection
Home
DB
VM
Home
DB
VM
Self-healing
The Move is On!
(11/18/09): F1000 Sample. Wave 8, n=148; Wave 9, n=150; Wave 10, n=151; Wave 11, n=127; Wave 12, n=147; Wave 13, n=183.
*Technology was previously categorized as Deduplication. **Technology was previously categorized as Deduplication/ Capacity Optimized Storage/Single Backup Instance Store. ***Technology was previously categorized as Single Backup Instance Store Software.
Heat Index Rank: 1
Storage Networking Wave 13
EMC: The #1 Choice for Deduplication in
F1000 Companies
SOURCE: TheInfoPro Wave 13 Storage Study (11/12/09): F1000 Sample: n=183.
EMC highest in
use and
Backup/Recovery
Architecture
on premise
off premise
Conventional
(Tape-centric)
Transformational
(Disk-centric)
Backup/Media
Manager
Onsite Backup
Storage
Disaster
Recovery
Storage
Application
Backup
Clients
VTL/Tape
Tape
Tape
VM
Home
DB
Deduplication storage
Backup Software
VTL
Data Protection Management Software
Deduplication backup software and system
Backup Software
Backup Software
Backup/Recovery
Architecture
on premise
off premise
Conventional
(Tape-centric)
Transformational
(Disk-centric)
Backup/Media
Manager
Onsite Backup
Storage
Disaster
Recovery
Storage
Application
Backup
Clients
Avamar
NetWorker
Disk
Library
DL/Tape
NetWorker
Tape
Tape
Data Domain
NetWorker
Data Protection Advisor
VM
Home
DB
Retain. Replicate. Recover.
Dedupe everything without
changing anything
Never backup the
same data twice
Data Domain
Deduplication Storage Systems
Avamar
Deduplication Backup Software
Data Domain Basics
Easy integration with existing environment
Replication
CIFS, NFS,
NDMP, DD Boost
Ethernet
Virtual Tape
Library (VTL) over
Fibre Channel
DD880 appliance
Control Tier
Target Tier
Disaster Recovery Tier
4U
2 to 6 ports
10 and 1 Gigabit Ethernet; 4 Gb/s Fibre Channel
RAID 6
5.4 TB to 142.5 TB usable capacity with shelves
1 TB or 500 GB 7.2k rpm SATA HDD in shelf
File system
NVRAM
N+1 fans and redundant, hot-plug power supplies
DD880 appliance
Scalability: Data Domain Systems Trajectory
Data Domain SISL scaling architecture
Data Domain SISL Scaling Architecture: CPU-Centric
T
h
ro
u
g
h
p
u
t
GB
/s
Addressable capacity in terabytes post-RAID
(physical)
1.25
70
> PB
1.5
0.04
5
3
DD880, July 2009
Industry’s fastest backup
storage controller
Multi-controller
Data Domain Infrastructure and Ecosystem
VMware
Microsoft
Microsoft SharePoint
Oracle
SAP
Backup
Midrange and
Mainframe Partners
LaserVault
Luminex
Archive
NAS, SAN, DAS
Data Domain Systems
DD140
DD610
DD630
DD660
DD690
DD880
GDA
DDX Array
Prior Benchmark
450 GB/hr
675 GB/hr
1.1 TB/hr
2 TB/hr
2.7 TB/hr
5.4 TB/hr
86.4. TB/hr
DD Boost
Throughput
490 GB/hr
1.3 TB/hr
2.1 TB/hr
2.7 TB/hr
3.9 TB/hr
8.8 TB/hr
12.8 TB/hr
140 TB/hr
Logical Capacity
17-43 TB
75-195 TB
165-420 TB
.520-1.31 PB
.710-1.7 PB
2.8-7.1 PB
5.7-14.2 PB
45.6-114 PB
Raw Capacity
1.5 TB
Up to 6 TB
Up to 12 TB
Up to 36 TB
Up to 48 TB
Up to 192 TB
Up to 384 TB
Up to 3.07 PB
Usable Capacity
0.86 TB
Up to 3.98 TB Up to 8.4 TB
Up to 26.1 TB
Up to 35.3 TB
Up to 142.5 TB
Up to 285 TB
Up to 2.28 PB
Software Options:
DD Boost, DD VTL, DD Replicator,
DD Retention Lock and DD Encryption
DDX Array
Why EMC Data Domain
Dedupe everything without changing anything
•
Less disk to resource, less to manage
–
CPU-centric deduplication
–
Inline
–
Green
•
Simple, mature, and flexible
–
Simple, mature appliance
–
Nearline tier: any fabric, any software, backup or
nearline applications
•
Resilience and disaster recovery
–
Storage of last resort
–
Cross-site global compression: data center or
remote office
EMC Avamar
•
End-to-end, software/hardware solution
•
Improves backup window, less network load
•
Integrated high availability and reliability
•
Flexible deployment options
–
Avamar Data Store
–
Avamar Virtual Edition
–
Agent-only for ROBO
Deduplication backup solution
Avamar
VM
EMC Avamar Moves Less Data
Network
B
A
C
C
D
D
B
A
B
A
C
B
A
C
B
C
D
D
A
B
A
B
A
C
D
1. Divides data into sub-file elements
2. Determines if elements are unique or duplicate
3. Backs up only data that is unique
Avamar Primary Use Cases
NAS Backup
Virtual fulls eliminates NAS full backups
Remote Office Backup
Minimize WAN traffic, central administration
VMware Infrastructure Backup
Guest or Image backup: minimize ESX server impact
Desktop / Laptop Backup
Why EMC Avamar
•
Move less data to eliminate backup
bottlenecks
•
Always recover in one simple step
•
Lower traditional backup TCO with
typically a 12-24 month payback
Never backup the same data twice
NetWorker – Unified Data Protection
File Systems
Server
Recovery
Application
Modules
Virtual
Deduplication
Remote Sites
Deduplication
Storage Systems
Applications
Centralized Management and Control
Scheduling, Policies and Reporting
NetWorker integration with Avamar and
Data Domain
•
Integrated Avamar Client Dedupe
•
Integrated Data Domain Boost
for dedupe at storage nodes
•
Best of class deduplication
–
Unified
–
Scalable to the largest enterprises
EMC Data Protection Advisor
EMC Assessment for Deduplication
Determines amount of duplicate data in your environment
Assess
•File System
Metric
Collection
•Deduplication
Metric
Collection
Analyze
• Deduplication
Ratio
• Storage Growth
Rate
• Data Types
• Weekly Change
Rates
• Single Instance
Savings
Action
•Solution
Sizing
•TCO/ROI
generation
•Results
Presentation
and
Discussion
Why EMC for Backup and Recovery with
Deduplication
SOURCE: TheInfoPro Wave 13 Storage Study (11/12/09): F1000 Sample: n=183.