I’m wondering if anyone as any suggestions for this problem.
I’m using intersect and except (Linq) with a custom IEqualityComparer in order to query the set differences and set intersections of two sequences of ISyncableUsers.
public interface ISyncableUser
{
string Guid { get; }
string UserPrincipalName { get; }
}
The logic behind whether two ISyncableUsers are equal is conditional. The conditions center around whether either of the two properties, Guid and UserPrincipalName, have values. The best way to explain this logic is with code. Below is my implementation of the Equals method of my customer IEqualityComparer.
public bool Equals(ISyncableUser userA, ISyncableUser userB)
{
if (userA == null && userB == null)
{
return true;
}
if (userA == null)
{
return false;
}
if (userB == null)
{
return false;
}
if ((!string.IsNullOrWhiteSpace(userA.Guid) && !string.IsNullOrWhiteSpace(userB.Guid)) &&
userA.Guid == userB.Guid)
{
return true;
}
if (UsersHaveUpn(userA, userB))
{
if (userB.UserPrincipalName.Equals(userA.UserPrincipalName, StringComparison.InvariantCultureIgnoreCase))
{
return true;
}
}
return false;
}
private bool UsersHaveUpn(ISyncableUser userA, ISyncableUser userB)
{
return !string.IsNullOrWhiteSpace(userA.UserPrincipalName)
&& !string.IsNullOrWhiteSpace(userB.UserPrincipalName);
}
The problem I’m having, is with implementing GetHashCode so that the above conditional equality, represented above, is respected. The only way I’ve been able to get the intersect and except calls to work as expected is to simple always return the same value from GetHashCode(), forcing a call to Equals.
public int GetHashCode(ISyncableUser obj)
{
return 0;
}
This works but the performance penalty is huge, as expected. (I’ve tested this with non-conditional equality. With two sets containing 50000 objects, a proper hashcode implementation allows execution of intercept and except in about 40ms. A hashcode implementation that always returns 0 takes approximately 144000ms (yes, 2.4 minutes!))
So, how would I go about implementing a GetHashCode() in the scenario above?
Any thoughts would be more than welcome!
If I’m reading this correctly, your equality relation is not transitive. Picture the following three
ISyncableUsers:A == Bbecause they have the sameUserPrincipalNameB == Cbecause they have the sameGuidA != Cbecause they don’t share either.From the spec,
If your equality relation isn’t consistent, there’s no way you can implement a hash code that backs it up.
From another point of view: you’re essentially looking for three functions:
Gmapping GUIDs to ints (if you know the GUID but the UPN is blank)Umapping UPNs to ints (if you know the UPN but the GUID is blank)Pmapping (guid, upn) pairs to ints (if you know both)such that
G(g) == U(u) == P(g, u)for allgandu. This is only possible if you ignoreganducompletely.