We need some coherent benchmarking setup, preferrably one that's run nightly / weekly on some consistent machine.