We need to double check that memory allocator (slab, slub) does not introduce lots of unnecessary synchronization. We need to ignore all synchronization in the allocator. Otherwise even fast-path will synchronize threads on per-cpu cache. Slow path will synchronize even threads running on different cpus. We should ignore that synchronization as much as possible.