Figure 6-5, Optimizing cache usage – Intel ARCHITECTURE IA-32 User Manual

Page 321

Advertising
background image

Optimizing Cache Usage

6

6-31

Figure 6-5

Memory Access Latency and Execution With Prefetch

2 Load streams, 1 store stream

50

100

150

200

250

300

350

54 108 144 192 240 336 390

Computa tions pe r loop

E

ff

e

c

ti

v

e

l

oop

l

a

te

nc

y

0.00%

10.00%

20.00%

30.00%

40.00%

50.00%

60.00%

70.00%

80.00%

90.00%

100.00%

%

o

f B

u

s

U

tili

ze

d

16

32

64

128

none

% Bus Utilization

One load and one store stream

0

50

100

150

200

250

300

350

48 108 144 192 240 336 408

Computations per loop

Ef

fe

c

ti

v

e

l

oop

la

te

nc

y

0.00%

10.00%

20.00%

30.00%

40.00%

50.00%

60.00%

70.00%

80.00%

90.00%

100.00%

%

o

f B

u

s

U

tiliz

a

tio

n

16_por

32_por

64_por

128_por

None_por

% Bus Utilization

Advertising