2 branch instructions and completion, Figure 6-7, Branch taken – IBM POWERPC 750GL User Manual

Page 227: Figure 6-8, Removal of fall-through branch instruction

Advertising
background image

User’s Manual

IBM PowerPC 750GX and 750GL RISC Microprocessor

gx_06.fm.(1.2)
March 27, 2006

Instruction Timing

Page 227 of 377

Figure 6-8 shows the removal of fall-through branch instructions, which occurs when a branch is not taken or
is predicted as not taken.

When a branch instruction is detected before it reaches a dispatch position, and if the branch is correctly
predicted as taken, folding the branch instruction (and any instructions from the incorrect path) reduces the
latency required for flow control to zero. Instruction execution proceeds as though the branch was never
there.

The advantage of removing the fall-through branch instructions at dispatch is only marginally less than that of
branch folding. Because the branch is not taken, only the branch instruction needs to be discarded. The only
cost of expelling the branch instruction from one of the dispatch entries rather than folding it is missing a
chance to dispatch an executable instruction from that position.

6.4.1.2 Branch Instructions and Completion

As described in the previous section, instructions that do not update either the LR or CTR are removed from
the instruction stream before they reach the completion queue, either for branch taken or by removing fall-
through branch instructions at dispatch. However, branch instructions that update the architected LR and
CTR must do so in program order. Therefore, they must perform write-back in the completion stage, like the
instructions that update the FPRs and GPRs.

Branch instructions that update the CTR or LR pass through the instruction queue like nonbranch instruc-
tions. At the point of dispatch, however, they are not sent to an execution unit, but rather are assigned a slot
in the completion queue, as shown in Figure 6-9 on page 228.

Figure 6-7. Branch Taken

Figure 6-8. Removal of Fall-Through Branch Instruction

IQ5

add5

IQ4

add4

IQ3

add3

IQ2

b

IQ1

add2

IQ0

add1

and2

and1

and6

and5

and4

and3

Branch Folding

(Taken Branch/BTIC Hit)

IQ5

add5

IQ4

add4

IQ3

add3

IQ2

b

IQ1

add2

IQ0

add1

and4

and3

and2

and1

Branch Folding

(Taken Branch/BTIC Miss)

Clock 0

Clock 1

Clock 2

Clock 0

Clock 1

Clock 2

IQ5

add5

IQ4

add4

IQ3

add3

IQ2

b

IQ1

add2

IQ0

add1

add8

add7

add6

add5

add4

add3

etc.

add9

add8

add7

add6

add5

Branch Fall-Through

(Not-Taken Branch)

Clock 0

Clock 1

Clock 2

Advertising
This manual is related to the following products: