|
|
|
|
|
by jzwinck
2179 days ago
|
|
Why do you consider computations to be opaque? Do you not have the source code? There is a ton of low hanging speed in many computations that people treat as black boxes. Often as the result of knowing something extra about the specific input data rather than relying on a generic implementation. In some cases all you need is to write NumPy code instead of Pandas code for a 2-3x speedup. Then suddenly your small cluster program runs on one machine. |
|
But for the latter, see discussion on shifting the pd compute to a RDBMS elsewhere in these comments.