I came across a #define in which they use __builtin_expect.
The documentation says:
Built-in Function:
long __builtin_expect (long exp, long c)You may use
__builtin_expectto provide the compiler with branch
prediction information. In general, you should prefer to use actual
profile feedback for this (-fprofile-arcs), as programmers are
notoriously bad at predicting how their programs actually perform.
However, there are applications in which this data is hard to collect.The return value is the value of
exp, which should be an integral
expression. The semantics of the built-in are that it is expected that
exp == c. For example:if (__builtin_expect (x, 0)) foo ();would indicate that we do not expect to call
foo, since we expectxto be zero.
So why not directly use:
if (x)
foo ();
instead of the complicated syntax with __builtin_expect?
Imagine the assembly code that would be generated from:
I guess it should be something like:
You can see that the instructions are arranged in such an order that the
barcase precedes thefoocase (as opposed to the C code). This can utilise the CPU pipeline better, since a jump thrashes the already fetched instructions.Before the jump is executed, the instructions below it (the
barcase) are pushed to the pipeline. Since thefoocase is unlikely, jumping too is unlikely, hence thrashing the pipeline is unlikely.