CS 3853 Architecture Notes on Appendix C Section 2

Read Appendix C

C2: Pipeline Hazards

A hazard prevents the next instruction from executing during its designated clock cycle.

Hazard Classifications

structural hazard: insufficient hardware due to overlapped execution.
data hazard: instruction needs data from a previous instruction before it is available.
control hazard: branch changes the PC after a later instruction has been fetched.

A hazard may require that the pipeline stalls until the hazard can be cleared.
For now, when a stall occurs:

all instructions issued later will also stall
all instructions issued earlier will continue so that the hazard can be cleared

Performance with stalls

We will compare an unpipelined machine in which instructions take several cycles to a pipelined machine with the same clock rate.

speedup =
Average instruction time unpipelined
Average instruction time pipelined
=
CPI unpipelined
CPI pipelined
with no hazards, CPI pipelined = 1.
with hazards, CPI pipelined = 1 + stall cycles per instruction
speedup =
CPI unpipelined
1 + stall cycles per instruction
in the case in which all instructions on the unpipelined machine take the same time and the pipeline is completely balanced with no overhead:
CPI unpipelined = pipeline depth and
speedup =
pipeline depth
1 + stall cycles per instruction

Structural Hazards

At some stage of the pipeline, two instructions require the same resource.
Example: A shared single-memory port for data and instructions

instruction memory is always used in the first stage of the pipeline for the instruction fetch
a load (or store) instruction will access the data memory in the 4th stage (MEM)
with a shared single-memory for data and instructions we cannot access the instruction memory and data memory in the same clock cycle.
Figure C.4 shows a load instruction followed by 4 non-memory instructions.

Here is a timing diagram showing the stall (like figure C.5):
This assumes none of the other instructions are loads or stores so they do not need to access memory in the MEM stage.

clock number

Instruction

Load instruction

MEM

Instruction i+1

MEM

Instruction i+2

MEM

Instruction i+3

stall

MEM

Instruction i+4

MEM

Instruction i+5

MEM

Instruction i+6

MEM

Today's News: September 9, 2015

No news

	clock number
Instruction	1	2	3	4	5	6	7	8	9
`LD R1,0(R2)`	IF	ID	EX	MEM	WB
`DSUB R4,R1,R5`		IF	ID	stall	EX	MEM	WB
`AND R6,R1,R7`			IF	stall	ID	EX	MEM	WB
`OR R8,R1,R9`				stall	IF	ID	EX	MEM	WB

Branch instruction	IF	ID	EX	MEM	WB
Branch successor		IF	IF	ID	EX	MEM	WB
Branch successor + 1				IF	ID	EX	MEM	WB
Branch successor + 2					IF	ID	EX	MEM	WB

Untaken branch instruction	IF	ID	EX	MEM	WB
Instruction i + 1		IF	ID	EX	MEM	WB
Instruction i + 2			IF	ID	EX	MEM	WB
Instruction i + 3				IF	ID	EX	MEM	WB

Taken branch instruction	IF	ID	EX	MEM	WB
Instruction i + 1		IF	idle	idle	idle	idle
Branch target			IF	ID	EX	MEM	WB
Branch target + 1				IF	ID	EX	MEM	WB
Branch target + 2					IF	ID	EX	MEM	WB

C2: Pipeline Hazards

Hazard Classifications

Performance with stalls

Structural Hazards

Data Hazards

Branch Hazards

Reducing the branch cost through prediction

Static Branch Prediction

Dynamic Branch Prediction