Simplify and speed up multiprocessing
This is a bit of a WIP, I moved away from Queue (since it seems to be the bottleneck)
From #265 (closed) the same test finishes (still slower) but in reasonable time:
$ time flake8 -j8 bar
real 0m17.583s
user 0m26.312s
sys 0m2.288s