Data deduplication is more than just a
BUZZ word
Per Larsen
DATA
BUDGET
RECOVERY &
DATACENTER
Mr. Hansen
•
More Storage
•
Longer Backups
•
Smaller Budgets
•
Less Staff
•
Fewer Projects
•
Missed SLAs
•
Wasted IT time
•
Legal Fee & Fines
Mr. Jensen
What does “dedupe–rate” mean?
What data can be deduplicated?
Where to run deduplication?
What is Deduplication?
• How does it work?
• What is the difference between file and block level?
4
What Does “dedupe–rate” Mean?
• Data deduplication ratio over a particular time period
What Does “dedupe–rate” Mean?
Space Reduction Ratio
Space Reduction Percentage
2:1
1/2 = 50%
5:1
4/5 = 80%
10:1
9/10 = 90%
20:1
19/20 = 95%
• Comparing ratios is problematic
–
Broad set of assumptions implicit in their calculation
100:1
99/100 = 99%
NetBackup to Target Dedupe Devices
•
Deduplicate at the appliance
•
Benefits include:
–
Easy setup, no change to backup environment
–
Centralized policy management and replication
control
Target Dedupe Devices
Client
control
•
Disadvantages:
–
Workload of a traditional backup until backup
data hits appliance — no infrastructure savings
–
Cost
= Deduplication Engine
OST Appliance
OST
NetBackup Deduplication at the Media Server
•
Deduplicate at the media
•
Benefits include:
–
Duplicate data eliminated at Media Server
–
Built-In to NetBackup
–
No client impact
Media Server
Client–
No client impact
•
Disadvantages
–
Workload of a traditional backup until data
reaches Media Server
–
Additional CPU load on Media Server
= Deduplication Engine
Media Server
NetBackup Deduplication at the Source
•
Deduplicated at the source/client
•
Benefits include:
–
Built-In to NetBackup
–
Reduced WAN/LAN bandwidth impact
–
Comprehensive application and platform
support
Client - Source
Client
–
Comprehensive application and platform
support
–
Ideal for most application and file/folder
backups
•
Disadvantages
–
May not be ideal for datasets with high
change rate
–
Additional load on the Client
= Deduplication Engine
Symantec
Backup Deduplication
“Closer to the Source” is Best
• Example: Moving 1TB of Data w/ 90% Dedupe Potential
Target
Dedupe
Target Deduplication Appliance Storage Target Deduplication Appliance NetBackup Media ServerGood
Transfer 1TB Transfer 1TB Transfer 0.1TBOST
Symantec Off Host Dedupe
What is NetBackup Appliances?
•
Symantec provides
Complete
•
Single vendor for
Easy
•
32TB usable dedupe
Scalable
•
Disk based solution,
Reliable
An Appliance from Symantec that Allows for a Simplified
and Faster Deployment of NetBackup Deduplication and
New NetBackup Media Servers
•
Symantec provides
software, hardware,
and support
•
Complete dedupe
solution—source or
target
•
Single vendor for
support and service
•
Easy ordering and
deployment
•
32TB usable dedupe
capacity per node
•
192TB usable global
dedupe capacity per
setup
•
Disk based solution,
Raid 6 configuration
•
Redundant fans,
power supply
NetBackup 5000 / 5020 Appliance
PDDO in a box
Standard NBU Clients
NBU Media Server(s)
NetBackup 5200 / 5220 Appliance
Media Server in a box
Standard NBU Clients
NBU 5200
Media Server
Standard NBU Clients
Data Protection Challenges from a Disaster perspective
Architecture Model – Remote/Branch Office backup
NBU Clients
Site B
NBU Media Server
Media server at remote site
=
Faster local recovery
Tape
NBU Media Server
Main DC site Location with Centralized
Administration Site A
Replicate to
the DC
Centralize and simplify backup management
Site C NBU Clients DR Site (Replication) NBU Media Server
What Sets NetBackup Deduplication Apart…
Built-In, easy to use
Works out of the box. No
additional setup required
No expensive hardware
Use commodity hardware of
choice
Lower TCO than Appliances
Up to 55% less than appliance
Proven Product Line, Proven
Technology
Built into NetBackup, based on
PureDisk Dedupe Technology
Investment Protection
Dedupe value wherever
needed-Source, Media Server, Appliance
Lower TCO than Appliances
Up to 55% less than appliance
Backup Re-architecture—Strategic Objectives
—
Eliminate tape as a transported media
—
Establish ‘always on’ WAN connectivity to recovery sites
—
Improve RTO/RPO—narrow the gap of tier 1 disk replication
—
Provide ‘always ready’ disaster recovery infrastructure
—
Provide operational consistency even in case of disasters
The Three Key Facilitators…
—
Data Deduplication technology coupled with highly efficient Data
Replication
—
3-site DR model – Primary, Secondary/Recovery, Data Bunker
Site B
Solution implementation plan
Backup Copy 3
Backup Copy 4
Short retention on disk
Direct tape creation
Long retention on tape
No tape vaulting
Opt-dupe Replication
Opt-dupe
Replication
Media server Media server Storage Pool sSite A
Site C - ‘bunker’
Summary - One size does not fit all…
• Customers often need more than 1 approach to deduplication
–
Client / Source Deduplication
–
Media Server Deduplication (Inline)
–
Target Appliance Deduplication (Post process)
• Each use case is different
• Each customers data is different
• Each customers data is different
• Each customers approach towards Data Protection is different
The Symantec deduplication strategy:
Thank you!
Thank you!
Thank you!
Copyright © 2010 Symantec Corporation. All rights reserved. Symantec and the Symantec Logo are trademarks or registered trademarks of Symantec Corporation or its affiliates in
the U.S. and other countries. Other names may be trademarks of their respective owners.
This document is provided for informational purposes only and is not intended as advertising. All warranties relating to the information in this document, either express or implied, are disclaimed to the maximum extent allowed by law. The information in this document is subject to change without notice.