This pattern comes up a lot but I can’t find a straight answer.
An non-critical, un-friendly program might do
while(True):
# do some work
Using other technologies and platforms, if you want to allow this program to run hot (use as much CPU cycles as possible) but be polite – allow other programs who are running hot to effectively slow me down, you’d frequently write:
while(True):
#do some work
time.sleep(0)
I’ve read conflicting information about whether the latter approach would do what I’d hope on python, running on a linux box. Does it cause a context switch, resulting in the behavior I mentioned above?
EDIT: For what’s worth, we tried a little experiment in Apple OSX (didn’t have a linux box handy). This box has 4 cores plus hyperthreading so we spun up 8 programs with just a
while(True):
i += 1
As expected, the Activity Monitor shows each of the 8 processes as consuming over 95% CPU (apparently with 4 cores and hyperthreading you get 800% total). We then spun up a ninth such program. Now all 9 run around 85%. Now kill the ninth guy and spin up a program with
while(True):
i += 1
time.sleep(0)
I was hoping that this process would use close to 0% and the other 8 would run 95%. But instead, all nine run around 85%. So on Apple OSX, sleep(0) appears to have no effect.
I’d never thought about this, so I wrote this script:
Just as a test. Running this with
strace -o isacontextswitch.strace -s512 python test.pygives you this output on the loop:select()is a system call, so yes, you are context switching (ok technically a context switch is not actually necessary when you change to kernel space, but if you have other processes running, what you’re saying here is that unless you have data ready to read on your file descriptor, other processes can run until then) into the kernel in order to perform this. Interestingly, the delay is in selecting on stdin. This allows python to interrupt your input on events such asctrl+cinput, should they wish, without having to wait for the code to time out – which I think is quite neat.I should note that the same applies to
time.sleep(0)except that the time parameter passed in is{0,0}. And that spin locking is not really ideal for anything but very short delays –multiprocessingandthreadsprovide the ability to wait on event objects.Edit: So I had a look to see exactly what linux does. The implementation in
do_select(fs\select.c) makes this check:In other words, if an end time is provided and both parameters are zero (!0 = 1 and evaluates to true in C) then the wait is set to NULL and the select is considered timed out. However, that doesn’t mean the function returns back to you; it loops over all the file descriptors you have and calls
cond_resched, thereby potentially allowing another process to run. In other words, what happens is entirely up to the scheduler; if your process has been hogging CPU time compared to other processes, chances are a context switch will take place. If not, the task you are in (the kernel do_select function) might just carry on until it completes.I would re-iterate, however, that the best way to be nicer to other processes generally involves using other mechanisms than a spin lock.