What’s the fastest, best way on modern Linux of achieving the same effect as a fork–execve combo from a large process ?
My problem is that the process forking is ~500MByte big, and a simple benchmarking test achieves only about 50 forks/s from the process (c.f ~1600 forks/s from a minimally sized process) which is too slow for the intended application.
Some googling turns up vfork as having being invented as the solution to this problem… but also warnings about not to use it. Modern Linux seems to have acquired related clone and posix_spawn calls; are these likely to help ? What’s the modern replacement for vfork ?
I’m using 64bit Debian Lenny on an i7 (the project could move to Squeeze if posix_spawn would help).
Outcome: I was going to go down the early-spawned helper subprocess route as suggested by other answers here, but then I came across this re using huge page support to improve fork performance.
Having tried it myself using libhugetlbfs to simply make all my app’s mallocs allocate huge pages, I’m now getting around 2400 forks/s regardless of the process size (over the range I’m interested in anyway). Amazing.