I’d like to calculate a non-uniformly distributed random number in the range [0, n - 1]. So the min possible value is zero. The maximum possible value is n-1. I’d like the min-value to occur the most often and the max to occur relatively infrequently with an approximately linear curve between (Gaussian is fine too). How can I do this in Objective-C? (possibly using C-based APIs)
A very rough sketch of my current idea is:
// min value w/ p = 0.7
// some intermediate value w/ p = 0.2
// max value w/ p = 0.1
NSUInteger r = arc4random_uniform(10);
if (r <= 6)
result = 0;
else if (r <= 8)
result = (n - 1) / 2;
else
result = n - 1;
I think you’re on basically the right track. There are possible precision or range issues but in general if you wanted to randomly pick, say, 3, 2, 1 or 0 and you wanted the probability of picking 3 to be four times as large as the probability of picking 0 then if it were a paper exercise you might right down a grid filled with:
Toss something onto it and read the number it lands on.
The number of options there are for your desired linear scale is:
It’s a simple sum of an arithmetic progression. You end up with n(n+1)/2 possible outcomes. E.g. for n = 1 that’s 1 * 2 / 2 = 1. For n = 2 that’s 2 * 3 /2 = 3. For n = 3 that’s 3 * 4 / 2 = 6.
So you would immediately write something like:
At that point you just have to decide which bin uniformRandom falls into. The simplest way is with the most obvious loop:
Given that you can work out optionsToDate without iterating, an immediately obvious faster solution is a binary search.
An even smarter way to look at it is that uniformRandom is the sum of the boxes underneath a line from (0, 0) to (n, n). So it’s the area underneath the graph, and the graph is a simple right-angled triangle. So you can work backwards from the area formula.
Specifically, the area underneath the graph from (0, 0) to (n, n) at position x is (x*x)/2. So you’re looking for x, where:
In that case you want to take x-1 as the result hadn’t progressed to the next discrete column of the number grid. So you can get there with a square root operation simple integer truncation.
So, assuming I haven’t muddled my exact inequalities along the way, and assuming all precisions fit: