• No results found

Prospects for Solid State Data Storage: Beyond Flash Memory and the Hard Disk Drive

N/A
N/A
Protected

Academic year: 2021

Share "Prospects for Solid State Data Storage: Beyond Flash Memory and the Hard Disk Drive"

Copied!
29
0
0

Loading.... (view fulltext now)

Full text

(1)

© 2006 IBM Corporation

Prospects for Solid State Data

Storage: Beyond Flash Memory

and the Hard Disk Drive

Gian-Luca Bona [email protected] IBM Research,

(2)

© 2006 IBM Corporation

Incumbent Semiconductor Memories

Performance Cost NOR FLASH NAND FLASH DRAM SRAM

Attributes for universal memories:

Highest performance

Lowest active and standby power

Unlimited Read/Write endurance

Non-Volatility

Compatible to existing technologies

Continuously scalable

(3)

Incumbent Semiconductor Memories

Performance Cost NOR FLASH NAND FLASH DRAM SRAM

A new class of universal storage device :

a fast solid-state, nonvolatile RAM

enables compact, robust storage systems with solid

state reliability and significantly improved

cost-performance

SLm-1 m+1 SLm WLn-1 WLn+1 WLn
(4)

IBM © 2006 IBM Corporation SL m-1 SL m+1 SL m WL n-1 WL n+1 WL n

Non-volatile, universal semiconductor memory

ƒ

Everyone is looking for a dense (cheap) crosspoint memory.

ƒ

It is relatively easy to identify materials that show bistable hysteretic behavior

(easily distinguishable, stable on/off states).

(5)
(6)

© 2007 IBM Corporation Trap Storage Saifun NROM Tower Spansion Infineon Macronix Samsung Toshiba Spansion Macronix NEC Nano-x’tal Freescale Matsushita FLASH

Extension FRAM MRAM PCRAM RRAM

PCM - SS Electrolyte

Polymer/

Organic Mechanical 3D Thyrister

Ramtron Fujitsu STMicro TI Toshiba Infineon Samsung NEC Hitachi Rohm HP Cypress Matsushita Oki Hynix Celis Fujitsu Seiko Epson IBM Infineon Freescale Philips STMicro HP NVE Honeywell Toshiba NEC Sony Fujitsu Renesas Samsung Hynix TSMC Ovonyx BAE Intel STMicro Samsung Elpida IBM Macronix Infineon Hitachi Philips IBM Sharp Unity Spansion Samsung Axon Infineon Spansion Samsung TFE MEC Zettacore Roltronics Nanolayer Nantero STMicro Hitachi Matrix (Sandisk) 3D-ROM Samsung Macronix Infineon T-RAM Sony 4Mb C-RAM (Product) 0.25um 3.3V 512Mb PRAM (Prototype) 0.1um 1.8V 4Mb MRAM (Product) 0.18um 3.3V 2Mb FRAM (Product) 0.35um 3.3V

Memory technology remains an active focus area for the industry

IBM working towards a 16GB part by 2010

STMicroelectronics is claiming significant progress in the development of a new type of

electronic memory that could eventually replace Flash

memory technology PERFORMANCE DURABILITY High Low Volatile Nonvolatile SCM HDD Flash N/A DRAM PERFORMANCE DURABILITY High Low Volatile Nonvolatile SCM HDD Flash N/A DRAM

(7)

Critical applications are undergoing a paradigm shift

Compute-centric paradigm Data-centric paradigm

Solve differential equations

CPU / Memory

Computational Fluid Dynamics Finite Element Analysis

Multi-body Simulations``

Analyze petabytes of data

Storage & I/O

Search and Mining

Analyses of social/terrorist networks Sensor network processing

Digital media creation/transmission Environmental & economic modeling Main Focus

Bottleneck

Typical Examples

Thesis: Disks or Flash can’t keep up w/data centric applications

Proposal: Develop device technology and build a high density array and demonstrate performance and endurance for the data-centric paradigm

(8)

© 2006 IBM Corporation

What are the limitations with disks?

ƒ Bandwidth – Access Time – Reliability - Power

ƒ Disk Performance improves very slowly

Gap between processor and disk performance widens

rapidly

Bandwidth gap can be solved with many parallel disks

• but need 10,000 disks today, 500,000 disks by 2020

but that’s just for a traditional high-end HPC system

data intensive problems are much worse

Access time gap has no good solution

• disk access times decrease only 5% per year

• complex caching or task switching schemes help - sometimes

ƒ Newest disk generations are less reliable than older ones

– Data losses occur in even the best enterprise-class storage systems

(9)

HDD Issue:

Mean Time to Data Loss

1 10 100 1000 10000 100000 1000000 10000000 100000000 1000000000 1 10 100 1000 10000 100000 1000000

Number of disk drives in system

T im e t o d a ta l o ss in d ays 100GB Drive 800GB Drive 3.2TB Drive

(10)

© 2006 IBM Corporation

What are the limitations with Flash?

ƒ

Read/Write Access Times – Write endurance – Block

architecture

ƒ

Flash Performance showing no improvement

Gap between processor and Flash performance continues to widen

Write endurance <10

6

and showing no improvement trends

Need >10

9

to cater to frequent writes as data continually

flows into the system

Tomorrow’s hand-held devices will be continuously updated

Intel applications characterized by continuous data streams

Access time gap has no good solution

(11)

Storage Historic Price Trend and Forecast

34nm 4-Layer 1-bit 27nm 4-Layer 2-bit 22nm 4-Layer 2-bit 18nm 4-Layer 3-bit 14nm 4-Layer 3-bit 14nm 4-Layer 4-bit
(12)

© 2006 IBM Corporation

<5.5

Cost ($/GB)

10

-4

HER (/TB)

35%

CGR

1

Standby (mW)

100

On Power (mW)

2

MTBF (MH)

10

9

- 10

12

Endurance

100

Data Rate (MB/s)

~100-200 ns

Access Time

Storage Class Memory Target Specifications

(13)

SCM Basic Concepts (Phase Change Example)

ƒ

Using a phase transition of a

Ge-Sb-Te alloy to store a bit

ƒ

Ge-Sb-Te

exists in a stable amorphous and a stable crystalline phase

– Phases have very different electrical resistances

ƒ

Transition between phases by controlled heating/cooling

– Write ‘1’ : short (10ns) intense current pulse melts alloy crystal => amorphous – Write ‘0’ : longer (50ns) weaker current pulse re-crystalizes alloy => crystalline

– Read : short weak pulse senses resistance, but doesn’t change phase

ƒ Non-Si based proprietary diode materials being developed for high-ON current density (> 107 A/cm2 – needed for PCM) and ultra-low OFF current density (< 1

A/cm2). Temperature Tmelt Tcryst Time (ns) F Y- Address Line X- Address Line PCM Alloy & Diode Current Pulse

(14)

© 2006 IBM Corporation

ƒ

Prototype memory device

with

ultra-thin (

3nm

)

films demonstrated Dec ’06

Current scales with area

ƒ

Fast

(<100ns SET)

ƒ

Low current

(< 100μA RESET)

ƒ 3nm * 20nm Æ

60nm

2

Flash roadmap for

2013

Æ

phase-change

scales

W

defined by lithography
(15)

Processing Cost and F²

ƒ The bit cell size drives the cost of any memory

ƒ Cell area is expressed in units of F² where F is the minimum lithographic feature of the densest process layer

– Half pitch dimension of metallization connecting drain and source for ICs

– MR sensor width in magnetic recording

ƒ Cell areas – DRAM 8F² Î 6F² – NAND 4F² Î 2F² – SRAM 100F² – MRAM 20F² -- 40F² – Hard Disk 0.5F² Î 1F² – …. F F

(16)

IBM Research © 2007 IBM Corporation

Low Cost Requires High Density

Need Effective Cell Size < 4F

2

(F

Æ

Lithography Half-Pitch)

2D

Æ

Better Scaling & Fewer Process Steps, but Requires

ƒ

Interface Between Litho (F) & Sub-Litho (Fs)

ƒ

Viable Method to Manufacture Sub-Lithographic Arrays

3D Lithographic

Crosspoint Memory

4F

2

/ (# of Layers)

2D Sub-Lithographic

Crosspoint Memory

4F

2

/ (# of Nodes)

2

F

F

s

Blocks of

Nanoscale

Crossbar

Arrays

(NCA

)

(17)

Crossbar Memory Fundamentals

F

F

1-D

Cell size = 4F

2

4F

2

/n

4F

2

/n

2

2-D

standard crossbar memory

more cells at a crossbar?

Net effect: Density n

2

Cost

n

2

What if we can put

(18)

IBM Research © 2007 IBM Corporation

WL1 WL2

Fs

Fop.

Sub lithographic feature is selected by

moving depletion across the fine structure

Modulating signal is brought in by

lithographically defined lines

Fins down to sub 20 nm have been addressed

WL1 WL2

S

WL1 WL2 S S F2 Cell 1.0E-10 1.0E-09 1.0E-08 1.0E-07 1.0E-06 1.0E-05 -14 -12 -10 -8 -6 -4 -2 0 Gate 2 Vo ltag e (V) A b s o lu te C u r r e nt i n Fi n s Gate 1 Voltage = -1.2V (a) > 100X FIN 1 F IN 2 FIN 3 F IN 4 1.0E-10 1.0E-09 1.0E-08 1.0E-07 1.0E-06 1.0E-05 -14 -12 -10 -8 -6 -4 -2 0 Gate 2 Vo ltag e (V) A b s o lu te C u r r e nt i n Fi n s Gate 1 Voltage = -1.2V (a) > 100X FIN 1 F IN 2 FIN 3 F IN 4 -1.2V -10V

Micro-Nanoscale Decoder

IEDM-paper 2005
(19)

Gate1 = -2.0V Gate1 = -2.0V

Obtained Fully

Functional Devices

100nm Pitch MNAB Devices

Fabricated by E-Beam Lithography

Selectivity > 10

5
(20)

IBM Research © 2007 IBM Corporation

Combining Micro-Nano Decoder and ROM

Oxide

(3-4 nm)

4-fin UMB+ROM test structure

FIB x-SEM through gated fins (A-A’)

9

Successful integration of UMB with memory element

( 2 terminal oxide antifuse ROM)

(21)

Nanoscale Patterning Techniques

ƒ

Various nanoscale patterning techniques exist.

ƒ

Sub 20 nm pitch demonstrated.

ƒ

Only regular line / space patterns possible.

(IBM T J Watson Research Center)

Self Assembly

Nanoimprint

Lithography

(IBM Almaden)

Spacers

Frequency doubling – 40 nm to 20 nm pitch

(22)

© 2006 IBM Corporation

2F

2F

4F

2

/

n

4F

2

/ n

2

4F

2

/

L

n

2 from

4F

2 to…

demonstrated

(at IEDM 2005) if we could shrink

4F

2 by

MLC NAND Flash

is projected* to be at

43

Gb/cm2

Æ

32

GB

density product * 2006 ITRS Roadmap

97

Gb/cm2

Æ

64

GB

n

387

Gb/cm2

Æ

256

GB

n

2

1550

Gb/cm2

Æ

~1

TB

L

n

2 (for m=n=4)
(23)

Value Proposition for SCM in Storage Controllers

ƒ Significantly improved cost/performance

long before SCM competes with DASD on cost

ƒ Simple cache model (cube root rule),

queuing effects ignored, miss rate starts at 50%

ƒ Columns in figure

– 1st – business as usual (BAU), DRAM

cache @ .2% of DASD capacity

– 2nd – same cost as BAU, but DRAM

replaced by SCM, SCM @ 2% of DASD capacity, perf. ~2x

– 3rd Hierarchical storage cost now

2.8x 1st column, SCM capacity now

20% of DASD, perf. >3.6x

– 4th – cost now 1-10x, SCM only

storage used in system. Perf. >12x

ƒ Performance assessed at application

interface to OS. So, I/O stack in host, fabric latency and storage controller microcode processing time and data transfer time are included 0 500 1000 1500 2000 2500 3000 Res pons e Ti m e i n us 0.20% 2% 20% 100%

% of total storage in cache. .2% DRAM (100ns), the rest SCM @ 1us

Response Time from Simple Cache Model

SCM (1us)

Disk (5ms) and SCM (1us) Disk (5ms) and DRAM (.05us) Controller stack (100us) fabric latency and transfer (25us)

Host stack (100us)

d

2725us

1385us

763us

(24)

© 2006 IBM Corporation

SCM impact on large HPC storage systems

ƒ

Large file throughput

>>10x improvement per TB of file system possible

Limited by interconnect and controller bandwidth

Limited by file system OS software overheads

Good match for check-pointing

Bulk storage costs high

ƒ

Small file and metadata access rates

Access rate improvement >100 feasible

(25)

Magnetic Racetrack Memory

•Data stored as pattern of domains in

long nanowire or “racetrack” of

magnetic material.

•Data stored magnetically and is

non-volatile.

•Current pulses move domains along

racetrack –

no moving parts, just the

patterns move.

•Each memory location stores an

entire bit pattern

(10, 100, 1000

bits?) rather than just a single bit.

(26)

© 2006 IBM Corporation

Magnetic Race-Track Memory

IBM trench DRAM

ƒ Information stored as domain walls in vertical “race

track”

Data stored in the third dimension in tall

columns of magnetic material

ƒ Domains moved around track using nanosecond pulses

of current

ƒ 10 to 100 times the storage capacity of conventional solid state memory

Magnetic Race Track Memory

S. Parkin (IBM),US patents

(27)

Large Magnetic Anisotropy for Single Atoms

ƒ The energy that is required to change the direction of a single spin was measured.

ƒ Large single-atom magnetic

anisotropy for iron of about 6 meV.

ƒ About 50x weaker anisotropy

for manganese on same surface.

ƒ Spin excitation spectroscopy reveals spin energy levels, including their magnetic field dependence.

ƒ DFT calculations elucidate surface structure and leads to same total spin as experiment.

ƒ GOAL: engineer very large magnetic anisotropy to demonstrate data storage.

-10 -8 -6 -4 -2 0 2 4 6 8 10 0.4 0.6 0.8 1.0 Conductivity Voltage [mV]

Mn

Fe

(28)

© 2006 IBM Corporation

• Racetrack memory

– a 3-D nano-warehouse for data

The Future of Memory?

at hard-drive prices

• Atomic memory

– “there’s a lot of room at the bottom…”

• Phase-change memory

low cost because

>1 bit / 4F

2

~2013

?

~2018

?

~2030

?

Science & Technology

(29)

Innovation and Impact

ƒ

Storage class memory (SCM)

New nonvolatile solid state memory with fast access and high

throughput

Robustness, volumetric density and power significantly better than

disks

ƒ

Will revolutionize memory/storage hierarchy

>10x throughput, >100x transaction rate potential

Applications will be revamped to exploit new technology

ƒ

New applications or significantly extended applications, e.g.

Sensor/actuator systems with storage at the network edge

Mobile applications, e.g. semiautonomous video gatherers

References

Related documents

Hotel is lately reconstructed, up-to-date and comfortable, offers 50 non-smoking rooms (9 single, 6 twin, 32 double, 3 suites). 430 m) The hostel is situated in a residential

vii) Self-attested copy of Residential/Domicile certificate from competent authority in terms of clause 7 of “Prescribed format for Domicile Certificate” available in

goal join predicate θ, we have considered two measures: the number of user interactions (i.e., the number of tuples that need to be presented to the user in order to infer the

Using historical data to establish the effect of gasoline price changes on consumer vehicle choice, a predictive model has been created showing the expected switch to

The target audience will be the organizational leaders, such as the director of education, education staff, the manager of the simulation laboratory, the clinical outcomes

Expanding pre-service education for entry to the practice of Dental Assisting At present, the educational requirements for Dental Assistants in Alberta include: graduation from

Porque sólo después de haber comprendido hasta qué punto somos amados, después de haber reconocido el «don de Dios» para uno mismo y para los demás, se puede oír

With this foundation, you explore a development model that addresses the complete range of issues in the design of embedded communications software, including real-time