• No results found

Storage Class Memory and the data center of the future

N/A
N/A
Protected

Academic year: 2021

Share "Storage Class Memory and the data center of the future"

Copied!
18
0
0

Loading.... (view fulltext now)

Full text

(1)

Storage Class Memory and

the data center of the future

Rich Freitas

(2)

HPC System performance trends

y

System performance requirement has historically double

every 18 mo and this trend is likely to continue

System

Year

TF

Nodes

Cores

Memory

Storage

GB/s

Disks

Bl

P

ifi

1998

3

1464

5856

2 6 TB

43 TB

3

5040

Blue Pacific

1998

3

1464

5856

2.6 TB

43 TB

3

5040

White

2000

12

512

8192

6.2 TB

147 TB

9

8064

Purple/C

2005

100

1536

12288

32-67 TB

2000 TB

122

11000

HPCS (rough est.)

2011

4000

20000

300000

2000-4000 TB

80000 TB

4000

80000+

© 2008 IBM Corporation

(3)

Memory power trends

y

DRAM: 2X size / chip / 3 year

Growth rate lower than the requirement for system performance

q

y

p

growth

Numerically, more memory chips will be needed in a system

to match the requirement for growth in system performance

to match the requirement for growth in system performance

Memory chip power is unlikely to decrease

P = active power + leakage power + refresh power

p

g p

p

DRAM can be put in standby mode, i.e., no active power, but

refresh and leakage power are still present albeit they may be at

reduced levels

(4)

Storage power extrapolation  2020

g

Devices

1.3 M Disks

5 M Disks

Compute centric

Data centric

Disk power is power to motor

plus power for seeking and

power for interface and control

space

4500 sq.ft.

16,500 sq.ft.

power

6,000 kW

22,000 kW

power for interface and control

electronics

Motor, interface and control

power are always present

power are always present

Active power is power for

seeking and transferring

Disk total drive power may

decrease a little in the future,

Disks will get somewhat smaller

Disks will get somewhat smaller,

physically probably dropping to

1.8”

R t ti

l

d

lik l t

Largest of current HPC

systems extrapolated to 2020

© 2008 IBM Corporation

4 GSRC Workshop August 4,2008 Rich Freitas

Rotational speed unlikely to

decrease

(5)

Net of memory and disk situation

y

Memory power will increase significantly over the

next few years

Storage power will increase significantly over the

t f

next few years

Data center power and cooling capabilities

unlikely to increase significantly over the same

unlikely to increase significantly over the same

interval

(6)

Definition of Storage Class Memory SCM

g

y

A new class of data storage/memory devices

many technologies compete to be the ‘best’ SCM

many technologies compete to be the best SCM

SCM features:

Non olatile (

10 ears)

Non-volatile ( ~ 10 years)

Fast Access times (~ DRAM like )

Low cost per bit more (DISK like

by 2015)

Low cost per bit more (DISK like – by 2015)

Solid state, no moving parts

SCM blurs the distinction between

SCM blurs the distinction between

MEMORY (fast, expensive, volatile ) and

STORAGE (slow cheap non volatile)

© 2008 IBM Corporation

6 GSRC Workshop August 4,2008 Rich Freitas

(7)

Phase-change

RAM

PCRAM

Bit-line

RAM

PCRAM

“programmable

resistor”

Access device

(transistor, diode)

Word

-line

temperature

T

melt

“RESET” pulse

Voltage

Potential headache:

High power/current

 affects scaling!

T

cryst

“SET”

p lse

Potential headache:

time

pulse

If crystallization is slow

(8)

Storage Class Memory

A solid-state memory that blurs the boundaries

between storage and memory by being

low-cost, fast, and non-volatile.

Speed

Storage-type

uses

Memory-type

uses

(Write)

Endurance

Cost/bit

Power!

Endurance

 SCM system requirements for

Memory

(Storage)

apps

• No more than 3-5x the Cost of enterprise HDD

(< $1 per GB in 2012)

<200nsec

(<1

sec)

Read/Write/Erase time

• >100,000 Read I/O operations per second

>1GB/sec

(>100MB/sec)

• Lifetime of

10

8

10

12

write/erase cycles

© 2008 IBM Corporation

8 GSRC Workshop August 4,2008 Rich Freitas

B-19

(9)

Memory/Storage Stack Latency Problem

Get data from TAPE (40s)

10

9

10

10

Century

Access DISK (5ms)

Storage

10

8

10

7

10

e

Access DISK (5ms)

n

s

10

5

10

6

m

an Scal

e

ime in

n

10

3

10

4

Access FLASH (20 us)

SCM

Hu

m

Get data from L2 cache (10ns)

Memory

Get data from DRAM or PCM (60ns)

T

i

10

2

10

Access PCM (100 – 1000 ns

)

CPU operations (1ns)

Get data from L2 cache (10ns)

y

10

1

(10)

SCM

If you could have SCM, why would you need anything else?

$1M / GB

SCM

$10k / GB

$100k / GB

NAND

$100 / GB

$1k / GB

DRAM

$1 / GB

Chart courtesy of

Dr. Chung Lam

IBM Research

To be published

$10 / GB

Desktop

HDD

p

IBM Journal R&D

$0.10 / GB

$0.01 / GB

SCM

© 2008 IBM Corporation

10 GSRC Workshop August 4,2008 Rich Freitas

$

(11)

SCM in a large System

Archival

CPU

RAM

TAPE

1980

Archival

Active Storage

Memory

Logic

CPU

RAM

DISK

TAPE

1980

CPU

RAM

FLASH

DISK

TAPE

SSD

2008

DISK

SSD

CPU

SCM

RAM

2013

...

DISK

TAPE

(12)

Active vs passive power

The blue area marks active power in the

power equations

Th

d

k

i

i

P

V

I

V

I

CV

2

f

The red area marks passive power in

the power equations

Passive power is unproductive. It just

causes heat

f

CV

I

V

I

V

P

M dd dd dd leak

refresh

V

I

V

I

r

d

P

D

4

.

6

2

.

8

i

&

c

s

&

t

causes heat

For memories it is leakage and refresh

power, which is typically smaller than

maximum active power

D

i

&

c

s

&

t

dd

dd

PCM

I

V

I

V

P

standby

active

For disks it is keeping the motor spinning

and the standby power of the

electronics, which is typically larger than

the maximum active power

passive

active

productive and active is device that the time of portion the is 

p

For PCM is is the leakage and small

standby power and is typically much

much smaller than the maximum active

power

motor disk the of power normalized the is  © 2008 IBM Corporation

12 GSRC Workshop August 4,2008 Rich Freitas

(13)

CPU’s are doing fine - focus on memory/storage stack

g

y

g

Issues

Opportunities

Goal: eliminate passive power and make active power more efficient

Redesign of DRAM and Disks to

eliminate passive power

Possible but not probable

DRAM has fast turn off and turn on

How can PCM be used to virtually

eliminate passive power?

–Active power is much greater than

passive power

times, but it is volatile

–Turning off DRAM when not active

causes data loss

Disks are nonvolatile and turn

–Turn on/off time ~50us

How can data be laid out to minimize

active power?

–Memory/storage pools

on/off in ~ 20-30 seconds

–On/off time to long for practical

active storage systems

–Storage systems that manage

Memory/storage pools

–hierarchy

How can active power be used more

efficiently?

g

y

g

power in this manner are called

MAID system.

–So far, only used for archive

systems

–Device design

–System architecture

–exploit virtualization (management

challenge)

(14)

Questions

© 2008 IBM Corporation

(15)

History of Phase-change memory

History of Phase change memory

y

g

y

• late 1960’s – Ovshinsky shows reversible electrical switching in

disordered semiconductors

• early 1970’s – much research on mechanisms, but everything was too slow!

ty

Crystalline phase

Low resistance

High reflectivity

n

ductivi

rical co

n

Amorphous phase

High resistance

Low reflectivity

Elect

r

Low reflectivity

© 2008 IBM Corporation

15 GSRC Workshop August 4,2008 Rich Freitas

[Neale:2001]

(16)

IBM Almaden Research center

average and peak disk storage power consumption

30

40

Wa

20

30

tts

0

10

W

a

tts

Standby Not

Spinning

Idle

Spinning

Typical R/W

Operation

Spinup

Peak

3.5" 15K RPM FC/SAS

3.5" 10K RPM FC/SAS

3.5" 7200 RPM SATA

2.5" 10K RPM SAS

2.5" Mobile 7200 RPM SATA

2.5" Mobile 5400 RPM SATA

USB Flash Disk

© 2008 IBM Corporation

16 GSRC Workshop August 4,2008 Rich Freitas

Note: Data for disks from datasheets/specs on Seagate website

3.5” FC/SAS disks are 300MB capacity, 3.5” 7200 RPM SATA disk is 500 GB

(17)

SCM Design Triangle

g

g

Speed

Memory-type

Uses

Storage-type

Uses

(Write) Endurance

Cost/bit

(18)

Energy & Infrastructure Cost exceed Server Cost *

Energy & Infrastructure Cost exceed Server Cost

* for cheap

for cheap

1U Servers

1:1

© 2008 IBM Corporation

18 GSRC Workshop August 4,2008 Rich Freitas

References

Related documents

In order to achieve homogeneous glass products the loading of the simulant Magnox sludge in the HM09 glass composition needs to be 30 wt% or lower.. Excess loading results in the

BA (0.5-3.0 mg/L) in combination of auxins like NAA, IAA and IBA (0.5 mg/L) were used to produce the direct shoot regeneration from young leaf of in vitro regenerated Solanum..

In terms of tablet offerings, both operators offer a range of 5 tablets, out of which 3 are LTE-enabled (the non-LTE models being the Apple iPad and another Android model),

Pronivo ™ k creates a small ramp between tile and floor coverings at lower elevations or finished concrete, providing a sloped surface that eliminates trip hazards and

There are too many people saying things like, &#34;We can't send our children to college because Jesus is coming.&#34; &#34;We can't get married now because Jesus is

When designed intentionally and used appropriately, a data-driven revenue cycle can overcome common roadblocks to performance, build and sustain margins, improve performance,

An hypothetical mechanism for the synergist effect between A2V- and FGK45-based therapies is proposed in Figure 20, explained by the creation of an

By starting to expand Contemporary dance discourse to include more world views, epistemologies, and different versions of Modern dance history, not only will critical discourse