I’m using the cut function to split my data in equal bins, it does the job but I’m not happy with the way it returns the values. What I need is the center of the bin not the upper and lower ends.
I’ve also tried to use cut2{Hmisc}, this gives me the center of each bins, but it divides the range of data in bins that contains the same numbers of observations, rather than being of the same length.
Does anyone have a solution to this?
It’s not too hard to make the breaks and labels yourself, with something like this. Here since the midpoint is a single number, I don’t actually return a factor with labels but instead a numeric vector.
There’s probably a better way to get the bin breaks and midpoints; I didn’t think about it very hard.
Note that this answer is different than Joshua’s; his gives the median of the data in each bins while this gives the center of each bin.