Some tl;dr highlights:
* the 84B packets are designed to simulate the latency characteristics of 100Gbit/s ethernet on 10Gbit/s hardware
* 2 cache misses are enough to blow the time budget for processing a packet
* The overhead of 1 syscall on SElinux is itself enough to blow the time budget
* Big picture strategy: batch packets, allocs, etc
Some tl;dr highlights:
* the 84B packets are designed to simulate the latency characteristics of 100Gbit/s ethernet on 10Gbit/s hardware
* 2 cache misses are enough to blow the time budget for processing a packet
* The overhead of 1 syscall on SElinux is itself enough to blow the time budget
* Big picture strategy: batch packets, allocs, etc