Performance tools, Intel® c++ compiler, Performance tools -9 – Intel ARCHITECTURE IA-32 User Manual

Page 81: Intel

Advertising
background image

General Optimization Guidelines

2

2-9

Performance Tools

Intel offers several tools that can facilitate optimizing your application’s
performance.

Intel

®

C++ Compiler

Use the Intel C++ Compiler following the recommendations described
here. The Intel Compiler’s advanced optimization features provide good
performance without the need to hand-tune assembly code. However,
the following features may enhance performance even further:

Inlined assembly

Intrinsics, which have a one-to-one correspondence with assembly
language instructions but allow the compiler to perform register
allocation and instruction scheduling. Refer to the “Intel C++
Intrinsics Reference” section of the Intel® C++ Compiler User’s
Guide
.

C++ class libraries. Refer to the “Intel C++ Class Libraries for
SIMD Operations Reference” section of the Intel® C++ Compiler
User’s Guide
.

Vectorization in conjunction with compiler directives (pragmas).
Refer to the “Compiler Vectorization Support and Guidelines”
section of the Intel® C++ Compiler User’s Guide.

The Intel C++ Compiler can generate an executable which uses features
such as Streaming SIMD Extensions 2. The executable will maximize
performance on the current generation of IA-32 processor family (for
example, a Pentium 4 processor) and still execute correctly on older
processors. Refer to the “Processor Dispatch Support” section in the
Intel® C++ Compiler User’s Guide.

Advertising