Consider the following code:
#include <iostream>
struct foo
{
// (a):
void bar() { std::cout << "gman was here" << std::endl; }
// (b):
void baz() { x = 5; }
int x;
};
int main()
{
foo* f = 0;
f->bar(); // (a)
f->baz(); // (b)
}
We expect (b) to crash, because there is no corresponding member x for the null pointer. In practice, (a) doesn’t crash because the this pointer is never used.
Because (b) dereferences the this pointer ((*this).x = 5;), and this is null, the program enters undefined behavior, as dereferencing null is always said to be undefined behavior.
Does (a) result in undefined behavior? What about if both functions (and x) are static?
Both
(a)and(b)result in undefined behavior. It’s always undefined behavior to call a member function through a null pointer. If the function is static, it’s technically undefined as well, but there’s some dispute.The first thing to understand is why it’s undefined behavior to dereference a null pointer. In C++03, there’s actually a bit of ambiguity here.
Although “dereferencing a null pointer results in undefined behavior” is mentioned in notes in both §1.9/4 and §8.3.2/4, it’s never explicitly stated. (Notes are non-normative.)
However, one can try to deduced it from §3.10/2:
When dereferencing, the result is an lvalue. A null pointer does not refer to an object, therefore when we use the lvalue we have undefined behavior. The problem is that the previous sentence is never stated, so what does it mean to “use” the lvalue? Just even generate it at all, or to use it in the more formal sense of perform lvalue-to-rvalue conversion?
Regardless, it definitely cannot be converted to an rvalue (§4.1/1):
Here it’s definitely undefined behavior.
The ambiguity comes from whether or not it’s undefined behavior to deference but not use the value from an invalid pointer (that is, get an lvalue but not convert it to an rvalue). If not, then
int *i = 0; *i; &(*i);is well-defined. This is an active issue.So we have a strict “dereference a null pointer, get undefined behavior” view and a weak “use a dereferenced null pointer, get undefined behavior” view.
Now we consider the question.
Yes,
(a)results in undefined behavior. In fact, ifthisis null then regardless of the contents of the function the result is undefined.This follows from §5.2.5/3:
*(E1)will result in undefined behavior with a strict interpretation, and.E2converts it to an rvalue, making it undefined behavior for the weak interpretation.It also follows that it’s undefined behavior directly from (§9.3.1/1):
With static functions, the strict versus weak interpretation makes the difference. Strictly speaking, it is undefined:
That is, it’s evaluated just as if it were non-static and we once again dereference a null pointer with
(*(E1)).E2.However, because
E1is not used in a static member-function call, if we use the weak interpretation the call is well-defined.*(E1)results in an lvalue, the static function is resolved,*(E1)is discarded, and the function is called. There is no lvalue-to-rvalue conversion, so there’s no undefined behavior.In C++0x, as of n3126, the ambiguity remains. For now, be safe: use the strict interpretation.