|
|
|
|
|
by 05
2675 days ago
|
|
If the bottleneck is encryption speed, then you can definitely improve perf by spreading packets across the cores. Inter-core synch isn’t that expensive, and 12us is 24000 cycles on a 2GHz CPU. cmpxchg costs ~20 cycles (https://stackoverflow.com/questions/4187914/average-latency-...). PS. And you don’t need to submit/receive packets to NIC one by one, either; those things support DMA scatter/gather. |
|