Informatica Executive Summit Nov. 3, Copyright 2010 EMC Corporation. All rights reserved.

36 

Loading....

Loading....

Loading....

Loading....

Loading....

Full text

(1)
(2)

Managing Data Growth in the 21

st

Century: Leveraging Virtualization &

Cloud Technology

Tony Pagliarulo

(3)

Agenda

EMC Focus & Strategy

EMC IT Journey to the Private Cloud

Data Virtualization Roadmap

(4)

About EMC

Fortune 500 Rank:

166

Revenues

(2010 estimate):

>

$16.9 billion

Employees

(end Q3 2010 worldwide):

47,000

Countries where EMC does business:

> 80

Total Cash and Investments

(year to date):

$10.5 billion

Quarterly Free Cash Flow

(year to date):

$2.2 billion

Market Value

(October 2010):

>

$44 billion

(5)

EMC’s focus is

IT

Infrastructure

EMC is a

TECHNOLOGY

company

EMC’s Focus

(6)

EMC’s Complementary Strategies

Information

Virtual

Infrastructure

Infrastructure

Information Storage

Information Management

Information Protection

Information Security

Information Intelligence

(7)

EMC IT at a Glance

User Profiles

48,000 “internal” users

400,000+ customers and partners

IT Environment

5 data centers, 7 PB storage

Business Applications

400+ applications and tools

Virtualization

6,000+ OS images (worldwide)

71% of all virtualized

85% of Intel virtualized

(8)

Globalization

Business Value

Security

Performance

Functionality

Cost of Ownership

Interoperability

Manageability

We have the

same

challenges as

our

customers

(9)

2009:

0.8 Zettabytes

Source: IDC Digital Universe Study, sponsored by EMC, May 2010

GROWING

by a factor of

2020:

35.2 Zettabytes

44

(10)

IT Infrastructure today …

Complex

Inefficient

Inflexible

Costly

72%

Maintain

28%

Invest

(11)

Enter The Cloud

(12)

What is

(13)

The Cloud is . . .

Built

DIFFERENTLY

:

Dynamic pools of virtualized resources

Operated

DIFFERENTLY

:

End-to-end service delivery

Consumed

DIFFERENTLY

:

Convenient for IT and for those they support

Private Cloud

(14)

Trusted

Controlled

Reliable

Secure

Multiple Incompatible

Architectures

(15)

Dynamic

Cost-Efficient

On-Demand

Controlled

Secure

Flexible

Reliable

Trusted

Multiple Incompatible

Architectures

x86 Architecture

Homogeneous

Implications for Today’s Data Centers

(16)

Dynamic

Cost-Efficient

On-Demand

Flexible

Dynamic

Cost-Efficient

On-Demand

Trusted

Controlled

Reliable

Secure

Flexible

Trusted

Controlled

Reliable

Secure

Dynamic

Cost-Efficient

On-Demand

Flexible

Implications for Today’s Data Center

Private

Cloud

Public

Cloud

Compute

Storage

Network

(17)

Operating System

Information

Security

Federation

Virtual

Applications

Virtualization Enables Cloud Computing

The Goal: Global Workload Deployment

Private

(18)

Virtualization (vSphere)

Information

Security

Federation (vMotion + VPLEX)

Private

Cloud

Public

Cloud

EMC IT’s Cloud Strategy

(19)

Our Journey to the Private Cloud

% Virtualized

15%

30%

50%

95%

IT-as-a-Service

IT Production

Business Production

Improve agility

Lower costs

Improved quality of service

Governance

Cloud enablement

Service management

VDC

Optimization

Standardization

Virtualization

Gold Platinum

85%

We are

here

(20)

The Journey to the Private Cloud

% Virtualized

15%

30%

50%

85%

95%

IT-as-a-Service

Improve Agility

IT Production

Lower Costs

Business Production

Improve Quality of Service

Platinum

Gold

Applications

Application portfolio rationalization

Application selection

Virtualization of CIO owned applications

Infrastructure

Data center consolidation

Virtualization strategy

Virtualization factory

Governance

Establish PMO

Design and implement transformation dashboard

(21)

EMC Enterprise Information Architecture

Rapid

Prototyping

Global Data Warehouse

Enterprise Data

Subject Oriented Marts

H

R

Revenue

PI Tool

TBD…

Source Systems

Catalyst

PeopleSof

Etc…

t

SAP

Oracle 11i

End User Query Tools

BI as a Service

POC 1

POC 2

BU App 1

BU App 2

Data Integration Layer

Informatica PowerCenter

Informatica Data Services

Data Federation

Master Data

Customer

Master

(22)

Enter The Cloud

Data Virtualization

Roadmap

(23)

Guiding Principles

Application/ Database Layer

Maintain as few copies of data as possible

Master Data Management (Informatica Siperian) as single source of

truth

Informatica Data Services to enable data federation

Subset data using Informatica Applimation

Transform and replicate data if needed

Informatica PowerCenter used to feed the Global Data Warehouse and

the subject marts

Archive data

Archive database data using Informatica Applimation

Email archiving using EMC SourceOne

(24)

Guiding Principles

Storage Layer

Better utilization using storage optimization techniques

File virtualization using EMC Rainfinity

Block virtualization using EMC vPlex

Virtual provisioning

De-duplication technology

Source de-duplication using EMC Avamar

Target de-duplication using EMC Data Domain

Object technology for primary storage/backup

(25)

Data ILM- Complete Lifecycle of Data

Nearline Database

3.

Archive for Application Retirement

– archive data to on-line content

addressable storage.

Retire legacy application and eliminate application and RDBMS license and

server costs

Maintain application independent access to archived data via ODBC/JDBC

Search, browse, view archived data through Informatica Data Discovery portal

1.

Archive for Performance

– archive (relocate) production data to less expensive

and virtualized infrastructure.

Improve core application performance and operational efficiency

Lower total application infrastructure cost

Maintain seamless application access to data

2.

Archive for Compliance

– archive data to on-line content address storage.

Meet compliance requirements while reduce risk and infrastructure cost

Maintain application independent access to archived data in compressed file via

ODBC/JDBC

Search, browse, view archived data in compressed file through Informatica Data

Discovery portal

(26)

Remove Large Amounts of Data using Data

Archive and Application Retirement

EMC Management for Oracle Applications with Informatica

Archive engine relocates all data

within identified tables and

entities based on the archiving

policy definition

Decrease capital and operating

costs

by reducing storage volume

of rarely-used data

Retired application data

stored in

highly compressed immutable

file archive format

ALM

Server

File

Archive

Server

Data Discovery

BI Tools

EMC Centera

Production

Informatica Data Archive

(27)

Reduce Storage Footprint with Data Subset

EMC Management for Oracle Applications with Informatica

Review the effect of subsetting

before removing the data

Data integrity and immediate

availability

for subsetted instances

Reduce the footprint

of module

storage by 78%

Test/Development:

EMC Symmetrix DMX-4

Data Subset Filter

Useful data

Production

(28)

Oracle 11i Applications – EMC IT Use Case

Poor Performance

Infrastructure Costs

(29)

Tape Backup of Prod

90 TB

Backup of (Prod, Splx, Dev, Test, etc) onto EDL

with RAID

28 TB

3 TB - Dev, Test, Training, Perf, etc RAID

12 TB - Dev, Test, Training, Perf

5 TB - Prod, Splx, SBY, ACT, Bkup Mirror

5 TB - Prod, Splx, SBY, ACT, Bkup DR

5 TB - Prod, Splx, SBY, ACT, Bkup Mirror

5 TB - Prod, Splx, SBY, ACT, Bkup

Oracle 11i Multiplier

1TB of Data

Storage Multiplier Effect (circa 2008)

10TB

5TB

15TB

20TB

32TB

35TB

63TB

153TB

(30)

Deduplication

Data ILM Journey

2008

2009

2010

2011

Oracle 11i Multiplier Effect – 1 TB

153 TB

• Tape Backup (90)

• EDL Backup (30)

• Non Prod (12)

• Disaster

Recovery (5)

• Production (5)

64TB

• EDL Backup (30)

• Non Prod (9)

• Disaster

Recovery (5)

• Production (5)

40TB

• EDL Backup (30)

• Non Prod (9)

• Disaster

Recovery (5)

• Production (5)

20TB

• Deduplication

• Archiving

• Retirement

Decommission of 3 Envs.

Elimination of Tape Backups

Reduce Backup Retention

Archiving

(31)

Enter The Cloud

Information Mgmt

Governance

(32)

However Impact Will Be Limited Without

Enterprise Data Governance

Other processes (partner, vendor, etc)

Sustain:

Ongoing

adjustment of business

rules, data cleansing and

sourcing

3

Data Governance

2

Integrate

:

Consistent

data integration across

multiple

processes driving

enterprise-wide analytics

and insights

Lead gene-ration Lead mgt Oppty mgt Order to cash Service and support

Customer lifecycle process

Feasibility Design Qualifi-cation

General Avail-ability

End of life

Product lifecycle process

Define:

Consistent data

definitions across a

single

process (over multiple

geos and functions)

(33)

Data Governance Best Practice

Data is an enterprise

asset, and should be

governed and secured

at the enterprise level

Business ownership of

data has to come

top-down from the highest

executives

IT is a key enabler,

but not the owner

Business users are the

data stewards and

content architects

CRM

• Customer Accounts • Partner Accounts

HR

• Employee • Contractor

ERP

• Product • Item • Orders

Eng/Svc

• Product Quality • Total Customer Experience

Other

• Shadow • NDA • Personal

MDM

Business Intelligence

Role-based Access

Compliance & reporting

(34)

www.EMC.com/emcit

EMC IT Journey to the Private Cloud:

A Practitioner's Guide

(35)
(36)

Figure

Updating...