I’m trying to using numpy.lib.stride_tricks.as_strided to iterate over non-overlapping blocks of an array, but I’m having trouble finding documentation of the parameters, so I’ve only been able to get overlapping blocks.
For example, I have a 4×5 array which I’d like to get 4 2×2 blocks from. I’m fine with the extra cells on the right and bottom edge being excluded.
So far, my code is:
import sys
import numpy as np
a = np.array([
[1,2,3,4,5],
[6,7,8,9,10],
[11,12,13,14,15],
[16,17,18,19,20],
])
sz = a.itemsize
h,w = a.shape
bh,bw = 2,2
shape = (h/bh, w/bw, bh, bw)
strides = (w*sz, sz, w*sz, sz)
blocks = np.lib.stride_tricks.as_strided(a, shape=shape, strides=strides)
print blocks[0][0]
assert blocks[0][0].tolist() == [[1, 2], [6,7]]
print blocks[0][1]
assert blocks[0][1].tolist() == [[3,4], [8,9]]
print blocks[1][0]
assert blocks[1][0].tolist() == [[11, 12], [16, 17]]
The shape of the resulting blocks array seems to be correct, but the last two asserts fail, presumably because my shape or strides parameters are incorrect. What values for these should I set to get non-overlapping blocks?
Starting at the
1ina(i.e.blocks[0,0,0,0]), to get to the2(i.e.blocks[0,0,0,1]) is one item away. Since (on my machine) thea.itemsizeis 4 bytes, the stride is 1*4 = 4. This gives us the last value instrides = (10,2,5,1)*a.itemsize = (40,8,20,4).Starting at the
1again, to get to the6(i.e.blocks[0,0,1,0]), is 5 (i.e.w) items away, so the stride is 5*4 = 20. This accounts for the second to last value instrides.Starting at the
1yet again, to get to the3(i.e.blocks[0,1,0,0]), is 2 (i.e.bw) items away, so the stride is 2*4 = 8. This accounts for the second value instrides.Finally, starting at the
1, to get to11(i.e.blocks[1,0,0,0]), is 10 (i.e.w*bh) items away, so the stride is 10*4 = 40. Sostrides = (40,8,20,4).