1 © Copyright 2006 EMC Corporation. All rights reserved.
Active Archiving
EMC Czech Republic Lipavsky_pavel@emc.com
2 © Copyright 2006 EMC Corporation. All rights reserved.
Broadest Range of Function, Performance, Media, and Connectivity
PATA 320 GB 5,400 rpm SATA 160 GB 7,200 rpm SATA 250 GB 7,200 rpm Fibre Channel 73 GB 10k/15k rpm Fibre Channel 300 GB 10k rpm Fibre Channel 146 GB 10k/15k rpm SATA 500 GB 7,200 rpm Low-cost Fibre Channel 500 GB 7,200 rpm iSCSI ChannelFibre IP FICON SAN NAS CAS
CO N T R O L CE N T E R
EMC’s Tiered Storage Offering
Invista Connectrix Symmetrix Celerra CLARiiON ADIC Scalar Family Centera CX700 CLARiiON Disk Library CX500 CX300 AX150 NSX NS Series/ Integrated Rainfinity Global File Virtualization iSCSI Centera 4-Node DMX1000 DMX-3 DMX800 NS Series/ Gateway
3 © Copyright 2006 EMC Corporation. All rights reserved.
Active Archiving solves the data explosion
ERP
CRM
EMC CenteraFileserving
4 © Copyright 2006 EMC Corporation. All rights reserved.Data classification
FILE Share NO N B u s in e s s d a ta Data storage *. p p t *. d o c *. x ls <30 days FC2 15k RPM HDD, RAID 1030-90 days 20% ATA RAID-5, NAS (IP Storage)
90+ dasy Archiving storage
20% 50% Data duplication (20%) B u s in e s s d a ta 10%
5 © Copyright 2006 EMC Corporation. All rights reserved.
VisualSRM Manageable Resources
VisualSRM LAN DAS Sun Solaris HP-UX Linux
Microsoft Windows NT, 2000, Cluster
NAS Exchange NetWare IBM AIX SQL Server Sybase Oracle SAN Sw itche s 6 © Copyright 2006 EMC Corporation. All rights reserved.
VisualSRM File-Level Policies
Intelligent Actions include: –Clean temp files
–Data manipulation
–Stage, move, delete, or compress data; run external scripts or programs
–Interactive (requires approval) or automatic
Policy task list: Series of related policies run in sequential order
Scans database, mail server, files, event logs
Provides consolidated and detailed reports
Standard reports; summary to detail
Easy-to-add custom reports Multiple report formats and delivery options (report jobs) Trending (capacity planning) –Partition, system, user, file type,
logical group, etc. Chargeback
7 © Copyright 2006 EMC Corporation. All rights reserved.
Proč archivovat?
7
Souborový server (FileShare)
1.
Nároky na diskovou kapacitu
2.
Dlouhý backup/restore
3.
Duplicita dat na disku / opakovaně zálohovaná data
Nepoužívaná data Občas používaná data Aktivní data
51% veškerých dat jsou nepotřebná, duplicitní nebo soukromá
68% dat nebyla použita za posledních 90 dnů nebo více
8 © Copyright 2006 EMC Corporation. All rights reserved.
Jak na to?
8 Nepoužívaná data Občas používaná data Aktivní dataCLARiiON CX
FC2 HDD 15k RPM RAID10
CLARiiON CX
FC2 HDD 10k RPM RAID5
Cellera/NAS, iSCSI
Archiv / Centera
DiskXTender
HSM
9 © Copyright 2006 EMC Corporation. All rights reserved.
DiskXtender
How DiskXtender Works
Filesystems are scanned and files matching policies are identified
Open systems storage platforms
CAS
RAID / SCSI ATA Tape Optical
Files can be moved to single or multiple targets, or across multiple tiers
UNIX, Linux, Windows, NAS filesystems
Policies classify files: Name, Size, Type, Last Access, Last Modified…
View is always local, and access is transparent to user or applications
10 © Copyright 2006 EMC Corporation. All rights reserved.
Key DiskXtender Features and Benefits
File migration can be triggered on storage-capacity usage, preventing application-performance degradation
Purge rules
Solutions for UNIX, Linux, Windows, and NAS
Broad platform and filesystem support
Supports disk, CAS, tape, and optical
Broad storage-device support
Integrated with Legato NetWorker and third-party backup applications
Integrated backup solutions
Can migrate to single targets or multiple targets or across multi-tier storage infrastructures
Multi-target and multi-tier
Fast recalls; no network dependencies
Native filesystem implementation
Flexibility to set policies for identifying files as migration candidates
Move files based on inactivity or for compliance
Powerful policy engine
Benefit Feature
11 © Copyright 2006 EMC Corporation. All rights reserved.
E-Mail Archiving with Centera and EMC MailXtender
Centera Business Impact
Transparent archiving of mails, no user action-activity needed
Efficient storage eliminates duplicates; meets regulatory requirements
Content authenticity and retention
Low
administration—configuring, managing, and self-healing Improved accessibility Massively scalable Mail Servers Clients E-Mail Archiving Application Centera
Active
Archive
L A N Centera API Active Mailboxes E-Mail Application Archived Mailboxes Inbox Outbox Sent SAN 12 © Copyright 2006 EMC Corporation. All rights reserved.User Access & Administrative Tools
Powerful Search and
Retrieval Capabilities
Access to Full-text Index
– Created of all messages and ATTACHMENTS
Users Retrieve Their own
Messages
An Administrator-Run Restore
from Backup is No Longer
Necessary!
User
Performs mail box activity
analysis to assist in
understanding system usage
patterns
– Top Sources – Monthly Storage Use – Attachment File Types
Creates Reports and Graphs
for Easy Assimilation of
Information
Locate and Delete
– Improper Emails – Virus Infected Messages
13 © Copyright 2006 EMC Corporation. All rights reserved.
Volumes
Archive structure
Mails will be archived into Container-Files.
– Removes NTFS file issue
Windows requires disk partition every 5mil NTFS file items
LEGATO groups into containers of up to 10,000 messages
Optimized to storage medium
– Tape medium require large files to minimize mechanical damage – Devices such as Centera have an
optimum
Optimized for network usage
– EX direct reads from container file – irrespective of tiers of storage
Improved back up and restore
– Container based on Categories. – Not File structure
Seamlessly integrated with DiskXtender
EmailXtender Server Full Text Index SQL 2000 Database 14 © Copyright 2006 EMC Corporation. All rights reserved.
How Does EmailXtender Archive E-mail?
Capture •In real time •Unique ID •De-duplication •Archive task Archive •Secure •Compression 2:1 •Container files •Tiered storage Organize and Classify • Exclusion-collection rules •Retention and disposal
E-mails on messaging server are now
“convenience copies”
Index •Messages and attachments •Embedded messages Messaging Servers EmailXtender ServerEmailXtender Search and
Discover •User searches
from plug-in or Web •Administrator
search for legal discovery Administer and Audit •Administration of a single central archive •Tools, reports, and other diagnostics
15 © Copyright 2006 EMC Corporation. All rights reserved.
Messaging Environments Supported by EmailXtender
Microsoft Exchange
–Exchange 5.5 SP 4 –Exchange 2000 SP 3 –Exchange 2003Bloomberg Messaging
Instant Messaging
–AOL –MSN –Sametime –OthersLotus Domino
–Domino 5.x–6.5.xWindows 2000 Server • IBM AIX V4.3.3, V5.1, V5.2 Windows 2003 Server • IBM AS/400 V5R1, V5R2, V5R3 Sun Solaris 8 (Solaris 5.8) • Red Hat Enterprise Linux V2.1 or V3.0
or 9 (Solaris 5.9)
UNIX SendMail
15
16 © Copyright 2006 EMC Corporation. All rights reserved.
Move inactive data off primary storage—save money in production storage and server resources
Reduce backup needs and streamline information management
Content authenticity Leveraging of ILM and consolidation Universal access Maximize storage efficiency Enterprise scalability
Fast, online standards-based access to content
Anywhere, any time, any application, from any platform
Self-management means no overhead Identical objects are stored only once
Massively scalable to petabytes of storage and billions of content items, serving multiple applications simultaneously
Future-proof architecture
Assures content has not been modified
Where needed, addresses governance and regulatory requirements
Effective Archiving Must Provide…
17 © Copyright 2006 EMC Corporation. All rights reserved.
Database stores Content Address for future reference
Centera performs Content Address calculation and
sends address back to application
Application server sends object to Centera over IP network Object is created
and sent to application server
How Centera Works
LAN CACA CA CA
Content Addressing
10001010 Digital fingerprint Globally unique Location-independent Content Address algorithm Content Address algorithm 10111011 18 © Copyright 2006 EMC Corporation. All rights reserved.Universal Access Makes Archiving Simple and Easy
Centera
Anywhere, any time, any application from virtually any platform
Centera API
NFS
CIFS
FTP
HTTP
19 © Copyright 2006 EMC Corporation. All rights reserved.
Centera Virtual Pooling Technology
CDF Blob
Pool 1
Application Pool 1
Pool 2
Application Pool 2
Pool 3
Application Pool 3
Default Pool
Default Pool
20 © Copyright 2006 EMC Corporation. All rights reserved.
Centera Architecture
Centera Node
Designated as “access” or “storage” node 2 GHz P4 processor
512 MB DDR RAM Four x 320 GB ATA
Two 10–100 BT network-interface cards (NICs) One Gigabit Ethernet NIC
Centera Network Dual 48-port cube switches
Gigabit Ethernet connections to facilitate additional racks
Centera Cabinet No single point of failure Dual AC power
Half-nodes / switches are on one AC rail Data is mirrored
Extreme Scalability Massive parallel processing
Add storage: processing power, memory, bandwidth
32-node CAB
Multiple cabinets in a single cluster Four-node
CAB
21 © Copyright 2006 EMC Corporation. All rights reserved.
Content Mirroring
Self-managed private LAN Storage nodes Switch Access nodes Switch SwitchPower rails ATS
22 © Copyright 2006 EMC Corporation. All rights reserved.
Self-Healing
Self-managed private LAN Storage nodes Switch Access nodes Switch SwitchPower rails ATS
23 © Copyright 2006 EMC Corporation. All rights reserved.
EMC RepliStor
24 © Copyright 2006 EMC Corporation. All rights reserved.
RepliStor Protects the Business
Replicate data continuously to one or
more servers, regardless of distance
Ensure data can be available
immediately after a fault
Replicate open and closed files
Replicate data from a Microsoft Cluster
Ensure SQL Server 2005 data
recoverability with
application-consistent VSS snapshot
IP network
Site A or
Server A Site B orServer B
Asy nch
ronous
Unlimited distance
On a Server Fault…
RepliStor restarts the application,
reconnects the data, and reconnects
the users at the secondary location
RepliStor RepliStor
25 © Copyright 2006 EMC Corporation. All rights reserved.
What About Monitoring Application Faults?
In the event of a site disaster, AutoStart would relocate application resources from Site A to Site B:
–Ensure read-write access to remote replica –Start application resources
–Start RepliStor replication from Site B to Site A EMC AutoStart manages automated
application and data restart on an alternate server, either local or remote AutoStart seamlessly transfers control
of application and storage resources to a local or remote facility in the event of an outage
AutoStart provides functionality to assist in automating failback of services, applications, and data AutoStart manages RepliStor remote
replication as a data source
Site A Site B
Unlimited distance
RepliStor
AutoStart AutoStart