• No results found

21-1

N/A
N/A
Protected

Academic year: 2021

Share "21-1"

Copied!
50
0
0

Loading.... (view fulltext now)

Full text

(1)

4 40 0 660 0 880 0 11000 0 112200 40 40 60 60 80 80 mm mm

21. Design for Testability

21. Design for Testability

J. A. Abraham J. A. Abraham

Department of Electrical and Computer Engineering

Department of Electrical and Computer Engineering

The University of Texas at Austin

The University of Texas at Austin

EE 382M.7 – VLSI I

EE 382M.7 – VLSI I

Fall 2011

Fall 2011

November 9, 2011 November 9, 2011

(2)

4 40 0 660 0 880 0 11000 0 112200 40 40 60 60 80 80 mm mm

Design for Testability (DFT)

Design for Testability (DFT)

Reduce costs associated with testing complex circuit Reduce costs associated with testing complex circuit Design circuit so that it will be easier to test

Design circuit so that it will be easier to test Increase

Increase accessibilityaccessibility of internal nodesof internal nodes

Controllability:

Controllability: ability to establish specific signal value atability to establish specific signal value at each internal node by setting inputs

each internal node by setting inputs Observability:

Observability: ability to determine internal values byability to determine internal values by controlling inputs and observing outputs

controlling inputs and observing outputs

Ensure

Ensure predictablepredictable circuit responsescircuit responses Tradeoffs

Tradeoffs

Technical: area, I/O pins, performance Technical: area, I/O pins, performance

Economic: design time, yield, time to revenue Economic: design time, yield, time to revenue

(3)

4 40 0 660 0 880 0 11000 0 112200 40 40 60 60 80 80 mm mm

Design for Testability (DFT)

Design for Testability (DFT)

Reduce costs associated with testing complex circuit Reduce costs associated with testing complex circuit Design circuit so that it will be easier to test

Design circuit so that it will be easier to test Increase

Increase accessibilityaccessibility of internal nodesof internal nodes

Controllability:

Controllability: ability to establish specific signal value atability to establish specific signal value at each internal node by setting inputs

each internal node by setting inputs Observability:

Observability: ability to determine internal values byability to determine internal values by controlling inputs and observing outputs

controlling inputs and observing outputs

Ensure

Ensure predictablepredictable circuit responsescircuit responses Tradeoffs

Tradeoffs

Technical: area, I/O pins, performance Technical: area, I/O pins, performance

Economic: design time, yield, time to revenue Economic: design time, yield, time to revenue

(4)

4 40 0 660 0 880 0 11000 0 112200 40 40 60 60 80 80 mm mm

Testable Sequential Circuits

Testable Sequential Circuits

Sequential circuits are very difficult to test Sequential circuits are very difficult to test

Design the internal memory Design the internal memory elements to be part of a elements to be part of a shifter register chain to shifter register chain to provide controllability, provide controllability,

observability through serial observability through serial shifts

shifts

With

With scan chainscan chain, problem of testing any circuit reduces to testing, problem of testing any circuit reduces to testing the combinational logic

(5)

4 40 0 660 0 880 0 11000 0 112200 40 40 60 60 80 80 mm mm

Level-Sensitive Scan Design (LSSD)

Level-Sensitive Scan Design (LSSD)

Structured DFT developed at IBM Structured DFT developed at IBM

All internal storage implemented in

All internal storage implemented in hazard-free polarity-holdhazard-free polarity-hold

latches (SRLs)

latches (SRLs), part of a, part of a scan chainscan chain

Latches controlled by

Latches controlled by two or more non-overlapping clockstwo or more non-overlapping clocks,, with rules for clocking

with rules for clocking

All clock inputs to SRLs must be in their off states when All clock inputs to SRLs must be in their off states when primary inputs (PIs) are “off”

primary inputs (PIs) are “off”

Clock signal at any clock input to an SRL must be controlled Clock signal at any clock input to an SRL must be controlled from one or more clock PIs

from one or more clock PIs

No clock can be ANDed with another clock No clock can be ANDed with another clock

Clock PIs cannot feed data inputs to latches, either directly or Clock PIs cannot feed data inputs to latches, either directly or through combinational logic

(6)

4 40 0 660 0 880 0 11000 0 112200 40 40 60 60 80 80 mm mm

Scan Chains

Scan Chains

Convert each flip-flop to a scan register Convert each flip-flop to a scan register

Only costs one extra multiplexer Only costs one extra multiplexer

Normal mode: flip-flops behave as usual Normal mode: flip-flops behave as usual

Scan mode: flip-flops behave as shift register Scan mode: flip-flops behave as shift register

Contents of flops can be scanned out and new values scanned Contents of flops can be scanned out and new values scanned in

(7)

40 60 80 100 120 40 60 80 mm

Scannable Flip-Flops

(8)

40 60 80 100 120 40 60 80 mm

Boundary Scan

Testing boards is also difficult

Need to verify solder joints are good Drive a pin to 0, then to 1

Check that all connected pins get the values

Single-sided PC boards with “through-hole” construction used “bed of nails” to contact pins of chip on the back side of the board

SMT and BGA boards cannot easily contact pins

Build capability of observing and controlling pins into each chip to make board test easier

(9)

40 60 80 100 120

40

60

80 mm

(10)

40 60 80 100 120

40

60

80 mm

(11)

40 60 80 100 120

40

60

80 mm

Boundary Scan Interface

Boundary scan is accessed through five pins

TCK: test clock

TMS: test mode select TDI: test data in

TDO: test data out

TRST*: test reset (optional)

Chips with internal scan chains can access the chains through boundary scan for unified test strategy

(12)

40 60 80 100 120 40 60 80 mm I DDQ

Testing

Measure quiescent power supply current of CMOS circuits for selected test vectors

Example, microprocessor has nanoamps of current with no faults; many defects (shorts, for example) cause much higher currents

Direct relationship found (Sandia Labs., HP, Phillips) between IDDQ test acceptance rates and quality, reliability of ICs

Only need to activate site of potential defect (no need to propagate errors)

Leakage current in very large chips may overwhelm the abnormal current due to a fault in one gate

(13)

40 60 80 100 120

40

60

80 mm

(14)

40 60 80 100 120

40

60

80 mm

Built-In Self Test (BIST)

Increasing circuit complexity, tester cost

Interest in techniques which integrate some tester capabilities on the chip

Reduce tester costs

Test circuits at speed (more thoroughly)

Approach:

Compress test responses into “signature”

Pseudo-random (or pseudo-exhaustive) pattern generator (PRG) on the chip

Integrating pattern generation and response evaluation on chip – BIST

(15)

40 60 80 100 120

40

60

80 mm

Pseudo-Random Sequence Generator (PRSG)

Linear Feedback Shift Register

Shift register with input taken from XOR of state

Pseudo-Random Sequence Generator (or Pseudo-Random Pattern Generator (PRPG), or Linear Feedback Shift Register (LFSR)) Step Q 0 111 1 110 2 101 3 010 4 100 5 001 6 011 7 111 (repeats)

(16)

40 60 80 100 120 40 60 80 mm

Example of BIST

Technique called

(17)

40 60 80 100 120

40

60

80 mm

Testability Issues – Custom Microprocessors

Chip complexity, large volumes, costs Area/performance penalties

Fault (defect) coverage goals

Test generation (fault simulation) costs Development time

Tester time (manufacturing costs)

ASICs generally use a “full scan” methodology for testability

Some of the ad-hoc  techniques used in custom microprocessors will

(18)

40 60 80 100 120

40

60

80 mm

Testability Features of the Motorola 68020

Chip Statistics:

Process: 2µ HCMOS

Die Size: 375 × 347 mils Transistor count: 200K

Test Overhead

Test Logic Area: < 3% Test Microcode: < 2%

Test routing shared with functional routing to keep chip area down

Bus structure helped create test partitioning for embedded structures

Tradeoff of extra time to produce ad-hoc test logic with expected higher yield

Test techniques for ROMs, PLAs, Cache

(19)

40 60 80 100 120

40

60

80 mm

Testing Scheme for ROMs in the 68000 Family

Initial schemes for testing 68000 family control structures involved bussing nanoROM outputs off-chip via the address bus in test mode

Could not test at speed since nanoROM output buffers were not sized to drive a bus

Only nanobits were observable, and the PLAs, microROM and nanoROM had to be tested as one large sequential machine Test sequences were lengthy and difficult to write

(20)

40 60 80 100 120

40

60

80 mm

Testability Techniques for 68020 ROMs

Used test mode to force next microcode address (NMA) from data pins

Data pins also control a MUX for both micro and nano ROM outputs, which are moved to the BC bus, into the data

section of the execution unit, and to the address bus which can be observed

Exhaustive testing of the 2K ROM entries

32 bits of ROM visible every 2 clocks

Four passes of tests needed to read the 110 outputs of the two ROMs

(21)

40 60 80 100 120

40

60

80 mm

Microrom Test in MC68881 Floating Point Co-processor

ROM physically split into two sections

In test mode, ROM is addressed directly through the command register

Exhaustive addresses fed to ROM in a “ping-pong” fashion (address/address complement)

Outputs of ROM go to two 16-bit signature registers (using CCITT-16 polynomial x15 + x12 + x5 + 1)

Monitor both the quotient and final signature serially on a test pin (Probability of aliasing 2−(n+m−1))

(22)

40 60 80 100 120 40 60 80 mm

MC68881 NanoROM Test

NanoROM physically located between the microROM and the execution unit (ECU), and outputs fed to ECU

No functional path for nanoROM to access signature registers

In test mode, nanoROM columns coupled with the microROM columns (with additional columns of nanoROM multiplexed to signature register)

(23)

40 60 80 100 120

40

60

80 mm

MC68881 Entry PLA Test

Four entry PLAs, A0, A1, A2 and A3

A0 PLA contains the entry point reset vectors and is completely tested functionally

A1–A3 tested like the ROMs

Command register generates patterns and outputs are routed through a bus to the signature register

Test patterns generated using PLA test generator

PLA Inputs Outputs Products

A1 6 10 23

A2 13 10 133

A3 5 10 28

(24)

40 60 80 100 120

40

60

80 mm

Built-in Self Test in the Intel 80386

Normal PLA inputs disabled during test and output of 

pseudorandom generator provides exhaustive set of tests to AND-plane input

CROM tested with binary counter (exhaustive test)

Responses compressed using multiple-input signature registers

Test transistors: 2.7% Area overhead for BIST: 1.8%

Transistor sites tested with BIST: 52.5%

(25)

40 60 80 100 120

40

60

80 mm

Testing Cache Memory Arrays in MC68030

Cache cell layout design is resistant to both bridging defects and capacitive coupling

Most likely bridging defect is between adjacent metal bit lines

Memory fault model:

One or more cells stuck at 0 or 1 Coupling between cells

11n March Test (Marinescu, 1982)

(26)

40 60 80 100 120 40 60 80 mm

Test Architecture of MC68040

Utilizes three different DFT methods

Functional tests for data paths with normal mode instructions Ad-hoc test modes constructed for testing on-chip memory arrays and on-chip phase-locked loop

Extension of existing functional snoop logic to allow arrays to be accessed from the bus similar to a static RAM array

Structured custom scan approach for ROM, PLA and control areas

Global test bus driven from two 5-bit registers coordinates test logic

IEEE 1149.1 boundary scan test interface

Boundary scan data register shared with scan logic

Test overhead:

18,560 devices, 1.55% of total number of devices

(27)

40 60 80 100 120 40 60 80 mm

MC68040 DFT Method Coverage

(28)

40 60 80 100 120

40

60

80 mm

(29)

40 60 80 100 120

40

60

80 mm

BIST in IBM Risc System/6000

LSSD with pseudo-random BIST

Common Engineer-ing Processor (COP), bus, on-card sequencer (OCS), engineering

(30)

40 60 80 100 120

40

60

80 mm

Common on-chip Processor (COP) in IBM RS/6000

Hardware for pseudo-random vector generation and result compression (31-bit LFSR)

Small processor controls operation

(31)

40 60 80 100 120

40

60

80 mm

BIST in IBM/Motorola Power-PC

Variety of test techniques applied to the Power-PC 603

Full LSSD test of logic

BIST of “large” embedded RAMS Functional test of small RAMS

I DDQ tests

BIST for cache and tag RAMs

Functional vectors (good for data cache, not instruction cache) and random BIST (size, complexity, test coverage) not

applicable

Use modified march test of Dekker (1988) log2 n pattern for data

Overhead: BIST is 2.9% of RAM array, 0.58% of total chip Performance impact: less than 100 pS due to extra MUX input leg

(32)

40 60 80 100 120

40

60

80 mm

Testing Embedded Cores in a System on Chip

Source/Sink for test: Embedded processor Test access mechanisms: Busses

Core test “wrapper” for Test Access Interface (TAI)

Lagged Fibonnaci Sequence random pattern generator

X n = (X n24 + X n55) mod m, n ≥ 55 Primitive polynomial: X 55 + X 24 + 1 Period: 2321 × (255 − 1) No multiplication necessary

x86 code for generator:

LDR R5,[R0],#4 LDR R6,[R1],#4 ADD R5,R5,R6 STR R5,[R2],#4 AND R0,R0,R3 AND R1,R1,R3 AND R2 R2 R3

(33)

40 60 80 100 120

40

60

80 mm

Issues with Built-In Self Test

Technique can run test sequences at operating frequencies and capture results within the chip

Pseudorandom pattern generators, signature analyzers Can also use weighted random

patterns or deterministic patterns Example (Synopsys): (Deterministic Logic BIST

Problems:

Hardware overhead Test power

(34)

40 60 80 100 120

40

60

80 mm

Industry Issues with Processor Testing

Concern with detecting real defects

Small delay defects due to process variations, power droops and capacitive coupling

Cause a shift in the speed of the part

Problems with logic BIST (same issues with scan AC tests) Overheads on chip

False paths tested

Test operating conditions different from normal operating modes

(35)

40 60 80 100 120

40

60

80 mm

Software-Based (Native-Mode) Self Test for Processors

Why not use functional capabilities of processors to replace BIST hardware?

No additional hardware

Reduce test costs by using low-cost testers

Increase coverage of delay defects and increase yield by testing native

No issues with excessive power consumption during test

Developed at University of Texas (Int’l Test Conference 1998)

(36)

40 60 80 100 120

40

60

80 mm

Native-Mode Signature Compression

Pseudo-code for software signature analyzer

// S: general register or memory, holds signature for each register Ri to be compressed

{

Shift Right Through Carry(Ri);

if (Carry) {S = XOR(S, feedback polynomial)}; S = XOR(S, Ri);

(37)

40 60 80 100 120

40

60

80 mm

Intel Functional BIST

Functional Random Instruction Testing at Speed (FRITS) applied to Itanium processor family and Pentium 4 line (ITC 2002)

Tests (kernels) are instruction sequences

Kernels loaded into cache and executed in real time during test application

They generate and execute pseudo-random or directed sequences of machine code

On Pentium-4, FRITS

added 5% unique coverage to manual tests

screened 10% – 15% of chips which passed wafer sort/package tests, but failed system tests

enabled low-cost testers: 40% increase in defect screening on structural tester

(38)

40 60 80 100 120

40

60

80 mm

Are Random Tests Sufficient?

Intel implementation involved code in the cache which generated random instruction sequences

Interest in generating instructions targeting faults

Possible to generate instruction sequences which will test for an internal stuck-at fault in a module

In order to deal with defects in DSM technologies, need to target small delay defects

Recent work: automatically generate instruction sequences which will target small delay defects in an internal module

(39)

40 60 80 100 120

40

60

80 mm

On-Chip Logic Structures for Debug

Distributed reconfigurable infrastructure fabric

User-guided insertion compatible with existing design flows Soft IP inserted at RTL

Configured post-silicon for maximum debug flexibility Activated dynamically

triggers to start/stop recording of signals assertion checkers

event detectors and counters pattern generators

Configuration and instrument control via JTAG TAP

(40)

40 60 80 100 120

40

60

80 mm

Instrumented Chip Example

(41)

40 60 80 100 120

40

60

80 mm

Logic Analysis and Signal Capture

DEMON block

implements triggers Signals captured by Tracer

Software reads trace buffer and prepares VCD or FSDB files

(42)

40 60 80 100 120

40

60

80 mm

Challenges in Detecting Small Delays

Need to test long paths at speed

External tester speeds do not match DUT speeds Tester skew, pad insertion delay

High-speed testers very expensive Can attempt AC-scan

Test clock frequency is determined by delay between launch and capture

Generate fast launch and capture signals on-chip Tester used only for scan in/out data

(43)

40 60 80 100 120

40

60

80 mm

Detecting Delay Defects After Manufacturing

Necessary to detect “small-delay defects”

Delay defects which don’t affect performance soon after manufacture could be reliability hazards

Need Path Delay Tests to ensure that defective parts are screened out

Difficult to apply two-pattern tests in scan mode

Requires precise control of clocks for high-speed circuits If capture clocks are based on the system clock, there is no information on the slack for the path

Solution: On-chip programmable capture mechanism

(44)

40 60 80 100 120

40

60

80 mm

Delay Lines for On-Chip Programmable Capture

(45)

40 60 80 100 120

40

60

80 mm

System for On-Chip Programmable Capture

(46)

40 60 80 100 120

40

60

80 mm

Power-Supply Noise Detection

Reference Voltage constant voltage Volta e Phase Detector  Detector  two signals se arated b a result  phase difference

(47)

40 60 80 100 120 40 60 80 mm

Voltage Detector

reset VDDclean RDL_out switch VDL_out Reference Delay Line

Varied Delay Line

VDDlocal RDL_outb VDL_outb INVregular  INVtail switch RDL_out VDL_out TRDL TVDL - TRDL

(48)

40 60 80 100 120 40 60 80 mm

Phase Detector

out<0> out<1> out<2> out<m> switch VDL_out RDL_out RDL_outb VDL_outb slow fast RDL_pulse VDL_pulse capture r 0 v0 r 1 v1 r 2 v2 r m v m INVregular  INVtail  pulse generator 

(49)

40 60 80 100 120

40

60

80 mm

Layout of Delay Lines

Interleaving RDL and VDL reduces the impact of systematic intra-die process variation

RDL_part1 VDL_part2 RDL_part3 VDL_part4

VDL_part1 RDL_part2 VDL_part3 RDL_part4

(50)

40 60 80 100 120

40

60

80 mm

Die Photo of the SCOUT Detector

Reference voltage generator 500um Voltage and phase detectors Performance monitor and test circuitry

References

Related documents

 Utilizar el cable serial propietario para este equipo.. Al entrar

 Pfd imkytad aledp nk Rlenyplk `mdy kmp flwd alk~ olk`d` pdly &amp;Plrlc lk` Tkncdwd  Pfd imkytad aledp nk Rlenyplk `mdy kmp flwd alk~ olk`d` pdly &amp;Plrlc

In conclusion, the purpose of the current review was to highlight the social and psychological elements attributed to AN by sufferers (Kelley, 1973; Nordbo et al., 2006; Woolrich

[r]

Long- term survival results of surgery alone versus sur- gery plus 5-fluorouracil and leucovorin for stage II and stage III colon cancer: Pooled analysis of NSABP C-01 through

In a nutshell, putting aside the semiconductor business, in x86 architecture both Intel and AMD are doing well and there are some ups and downs in their respective businesses but

The most basic algorithm for the problem for graphs is based on repeated SCC (strongly connected component) computation, and informally can be described as follows: for a given SCC,

As regards common clinical areas in which NSIs and sharps injuries to occur, findings of the present study revealed that about 23% students at health professional colleges