• No results found

Low-Power Error Correction for Mobile Storage

N/A
N/A
Protected

Academic year: 2021

Share "Low-Power Error Correction for Mobile Storage"

Copied!
12
0
0

Loading.... (view fulltext now)

Full text

(1)

© Copyright Silicon Motion, Inc., 2013. All Rights Reserved.

Low-Power Error Correction for

Mobile Storage

Jeff Yang

P

rinci

ple Engineer

Silicon Motion

Flash Memory Summit 2014 Santa Clara, CA

(2)

Power Consumption

 The ECC engine will consume a great percentage of power in the controller

 Both RAID and LDPC become crucial to high reliable flash controller, which is

employed in embedded applications.

 Stronger correction capability

• Longer ECC chunk (e.g., 2KB BCH is stronger than 1KB, and 2KB LDPC is much stronger than 1KB)

• Higher decoder resolution and parallelism architecture will reduce the decoding latency. • Higher power consumption.

 Wider range corrupt data recovery

• Larger RAID recovery range will provide wider range data recovery. • Consume more memory size and calculation logic.

• Higher power consumption.

In normal operation…

• For RAID engine, only the encoder part is active all the time.

• LDPC uses the hard-decoding only, but the encoder is always active in each flash write action.

 Power consumption reduction in hot region

• RAID encoding • LDPC decoding

SMI Confidential

2

Flash Memory Summit 2014 Santa Clara, CA

(3)

RAID Engine

The SSD module failure rate will be the Key for TLC adoption on SSD.

Latest Planar NAND might be 1znm.

• Bit column, sense-Amp, address decoder, or charge pump corrupted • WL lining or WL to WL short.

First popular 3D structure on BiCs.

• Inherit most of failure cases from 2D.

• High aspect ratio etching and larger number of layers control will contribute more failure cases.

No Erase/Program failure, but with reading ECC failure.

• Need RAID engine to do the error recovery.

• In 3D TLC, three WL lining may cause data loss over 9 pages.

High efficiency RAID with erasure decoding.

• Encoding with high efficiency memory access. • Memory access dominates the encoding power. • Low RAID recovery trigger rate.

3

Lower grade TLC flash with RAID to have 2.6K endurance

[Ref]: VLSI symp. 2009, pp.136-137, VLSI symp. 2008, pp. 192-193 VLSI symp. 2010, pp. 131-132,

IEEE Trans. Electron Devices, vol.58, no.5, pp. 1006-1014, Apr.2011

Flash Memory Summit 2014 Santa Clara, CA

(4)

 NAND Flash characteristics on

Endurance and Retention.

 Retention capability on TLC degrades

rapidly as P/E cycle increases.

 Reduce the power with Retention-aware

algorithm:

• Predict the sensing point in the first read. • Joint the soft-decoding and the Vt-tracking. • Minimize the number of read operation

 The error bit number is the key factor of

power consumption.

• Sensing the data which has lowest bit error rate

• Predict error types and select the decoding strategy.

4

Flash Memory Summit 2014 Santa Clara, CA

(5)

© Copyright Silicon Motion, Inc., 2013. All Rights Reserved.

NAND characteristic. SLC/MLC/TLC

5 Flash Memory Summit 2014

(6)

© Copyright Silicon Motion, Inc., 2013. All Rights Reserved.

TLC Error Behavior on Endurance

6 Flash Memory Summit 2014

Santa Clara, CA

X: Number of error bit

Y: Number of ECC chunk

Endurance 2K, 90% of the error bit

number is less than 30 bits

2K: CSB-RBER = 2.7e-3

1K: CSB-RBER = 1.3e-3

• Sensing the data with lowest error bit. • Predict the error case and select the

decoding strategy.

Soft-decoding/RAID trigger rate

X: P/E cycle

After the 2.7K cycling, it needs LDPC

soft-decoding.

RAID redundant overhead ~ 0.07%.

Double strong endurance from LDPC

hard to soft decoding.

First RAID recovery occurs on 4.5K

(7)

© Copyright Silicon Motion, Inc., 2013. All Rights Reserved.

LDPC Low Power Decoding Technology

Flash Memory Summit 2014

Santa Clara, CA 7

Quantization Optimization for the Hard-decoding mode.

Change floating point calculation system Clock gated control for shift register

Smart Error prediction Retiming optimization Adaptive clipping 3A mW 2A mW A mW 3A mW Future work New decoding algo?

1.3A mW

Traditional LDPC engine

structure is here

Power efficient LDPC

engine structure

(8)

© Copyright Silicon Motion, Inc., 2013. All Rights Reserved.

1KB BCH vs. LDPC decoding power consumption

TSMC40nm on TLC flash

Flash Memory Summit 2013

Santa Clara, CA 8

Error bit range /1KB BCH LDPC NOTE

1~20 bit A mW 0.9A mW ~400MB/sec

20~40 bit B mW 1.4B mW ~400MB/sec

40~60 bit C mW 2.6 C mW ~300MB/sec

Endurance BCH LDPC NOTE

0~1K P/E A mW 0.9A mW Hard-bit only

1.~2K B mW 1.4B mW Hard-bit only

2K~5K NA 2.6C mW, poor throughput soft-decoding trigger frequently .

If requirement is 1K cycles, Will be the BCH enough

to support the TLC?

(9)

© Copyright Silicon Motion, Inc., 2013. All Rights Reserved.

After Data Retention at 500P/E

Flash Memory Summit 2013

(10)

Comparison between 1KB and 2KB

 Error bit under 2KB chunk is 110bit.

• Almost 100% correctable rate for Hard-decision only. • 851 MB/sec under 400MHz operation frequency.

 When the error bit under 1KB chunk is 60bit.

• Almost 100% correctable rate for Hard-decision only. • 330 MB/sec under 400MHz operation frequency.

 1KB vs. 2KB under the same error condition

SMI Confidential

10

1KB/60bit 2KB/110bit

Area (gate) 1A 1.8A

TP (@400MHz) 330 MB/sec 851 MB/sec Power (W) 1A 2.15A

Power Efficiency 1A MB/J 1.2A MB/J Area Efficiency

(per gate)

1A B/s 1.4A B /s

UBER=1e-15 RBER=1e-2 RBER=1.25e-2

Flash Memory Summit 2014 Santa Clara, CA

(11)

Ideal Solution for TLC-based SSD

The 2KB BCH engine only has limited improvement on endurance and retention.

LDPC ideally fits TLC and maximizes the endurance of TLC-based SSD solution.

LDPC + RAID is the ONLY reliable solution for TLC-based SSD.

11

BCH 1KB+124B, t=70bit, RBER=3.1e-2, UBER=1e-15 P/E = 2164

BCH 2KB+228B, t=121bit, RBER=3.5e-2, UBER=1e-15 P/E = 2379

LDPC 1K+124B, t=~120 RBER= 1e-2, UBER=1e-15

Flash Memory Summit 2014 Santa Clara, CA

(12)

© Copyright Silicon Motion, Inc., 2013. All Rights Reserved.

Q & A

Flash Memory Summit 2014

Santa Clara, CA 12

Disclaimer Notice

Although efforts were made to verify the completeness and accuracy of the information contained in this presentation, it is provided “as is” as of the date of this document and always subject to change.

References

Related documents

Reporting to CSO Baxter International, Member of Baxter’s Global Senior Management Team and Innovation Council.. 15 management direct reports (including

Background/Aims: We investigated the utility of serum C-reactive protein (CRP) and procalcitonin (PCT) for differentiating pulmonary tuberculosis (TB) from bacterial

• Good Flash storage is as reliable as RAID card. •

My own theory accounts for the entire set of empirical regularities in terms of two highly plausible phenemena: (1) a compensating wage differential for the higher cost of living

48.. of interviews, I realised that taking interview in their places was not always helpful as they often busy with many works and talking for a long duration, hence

This result proves how developing countries need to enhance their Trade Facilitation aspects, especially imports time in order to increase trade not only between them but also

environmental costs (outside of project scope) 25-yrs 10% (B/C) 0.29–1.03 levees, 0.7–1.34 dams, 1.1 flood diversion Structural Backward- looking and forward- looking Khan