COMPUTER ORGANIZATION AND DESIGN. 5 th Edition. The Hardware/Software Interface. Chapter 4. The Processor. Dr. Randa Mohamed

(1)

C

OMPUTER

O

RGANIZATION AND

D

ESIGN

The Hardware/Software Interface

5th

Edition

Chapter 4

The Processor

(2)

Agenda

◼

Performance Issues

◼

MIPS Pipeline:

(3)

(4)

Performance Issues

◼

Longest delay determines clock period

◼ Critical path: load instruction

◼ Instruction memory → register file → ALU →

data memory → register file

(5)

Performance Issues

◼

Not feasible to vary period for different

instructions

(6)

Pipelining Analogy

◼

Pipelined laundry: overlapping execution

◼ Parallelism improves performance

(7)

MIPS Pipeline

◼

Five stages, one step per stage

1. IF: Instruction fetch from memory

2. ID: Instruction decode & register read

3. EX: Execute operation or calculate address

4. MEM: Access memory operand

5. WB: Write result back to register

(8)

(9)

Pipeline Performance

◼ Assume time for stages is

◼ 100ps for register read or write ◼ 200ps for other stages

◼ Compare pipelined datapath with single-cycle

datapath

(10)

Pipeline Performance

Single-cycle (T_c= 800ps)

(11)

Pipeline Speedup

◼

Speedup = CPUtime

_N

CPUtime

_P

◼ CPUtime_N= IC × T_N

(12)

Pipeline Speedup

◼

Speedup = CPUtime

_N

CPUtime

_P

◼ CPUtime_N= IC × T_N _3×800=2400

(13)

Pipeline Speedup

◼

Speedup = CPUtime

_N

CPUtime

_P

◼ CPUtime_P= No. Stages × T_P+ (IC-1) × T_P

Pipelined (T = 200ps)

(14)

◼ Speedup = IC × T_N

No. Stages × T_P+ (IC-1) × T_P = IC × T_N

T_P (No. Stages + IC-1)

Pipeline Speedup

◼

Speedup = CPUtime

_N

CPUtime

_P

(15)

No. Stages × T_P+ (IC-1) × T_P = IC × T_N

T_P (No. Stages -1+ IC)

Pipeline Speedup

◼

Speedup = CPUtime

_N

CPUtime

_P

(16)

No. Stages × T_P+ (IC-1) × T_P

= IC × T_N = T_N = No. Stages × T_P Tp× IC T_P T_P

Pipeline Speedup

◼

Speedup = CPUtime

_N

CPUtime

_P ◼ CPUtime_N= IC × T_N

= No. Stages

3×800=2400 (5×200)+ (2×200)=1400

1.7

(17)

No. Stages × T_P+ (IC-1) × T_P

= IC × T_N = T_N = No. Stages × T_P Tp× IC T_P T_P

Pipeline Speedup

◼

Speedup = CPUtime

_N

CPUtime

_P ◼ CPUtime_N= IC × T_N

= No. Stages

1000×800=800000 (5×200)+ (999×200)=200800

3.9

(18)

(19)

Pipelined Datapath

◼

Need registers between stages

(20)

Pipeline Operation

◼

Cycle-by-cycle flow of instructions through the

pipelined datapath

◼ “Single-clock-cycle” pipeline diagram ◼ Shows pipeline usage in a single cycle ◼ Highlight resources used

◼ c.f. “multi-clock-cycle” diagram ◼ Graph of operation over time

(21)

(22)

(23)

(24)

(25)

WB for Load

(26)

(27)

(28)

(29)

(30)

Multi-Cycle Pipeline Diagram

(31)

Multi-Cycle Pipeline Diagram

(32)

Single-Cycle Pipeline Diagram

(33)

(34)

(35)

Pipelined Control

◼

Control signals derived from instruction

(36)

(37)

Concluding Remarks

◼ ISA influences design of datapath and control ◼ Datapath and control influence design of ISA ◼ Pipelining improves instruction throughput

using parallelism

◼ More instructions completed per second ◼ Latency for each instruction not reduced

(38)

COMPUTER ORGANIZATION AND DESIGN. 5 th Edition. The Hardware/Software Interface. Chapter 4. The Processor. Dr. Randa Mohamed

C

O

D

Chapter 4

The Processor

Agenda

Performance Issues

MIPS Pipeline:

Performance Issues

Longest delay determines clock period

Performance Issues

Not feasible to vary period for different

instructions

Pipelining Analogy

Pipelined laundry: overlapping execution

MIPS Pipeline

Five stages, one step per stage

Pipeline Performance

Pipeline Performance

Pipeline Speedup

Speedup = CPUtime

CPUtime

Pipeline Speedup

Speedup = CPUtime

CPUtime

Pipeline Speedup

Speedup = CPUtime

CPUtime

Pipeline Speedup

Speedup = CPUtime

CPUtime

Pipeline Speedup

Speedup = CPUtime

CPUtime

Pipeline Speedup

Speedup = CPUtime

CPUtime

Pipeline Speedup

Speedup = CPUtime

CPUtime

Pipelined Datapath

Need registers between stages

Pipeline Operation

Cycle-by-cycle flow of instructions through the

pipelined datapath

WB for Load

Multi-Cycle Pipeline Diagram

Multi-Cycle Pipeline Diagram

Single-Cycle Pipeline Diagram

Pipelined Control

Control signals derived from instruction

Concluding Remarks

Problems to solve