|
|
|
|
|
by drej
2059 days ago
|
|
As for traditional TensorFlow, the easiest way we found to improve performance (easily 2x) was to find/create builds tailored to our machines. Using Python, we had prebuilt wheels, which have (understandably) low feature requirements. If you find/build your own (e.g. if you have AVX-512), you can easily get pretty detect performance gains. (Yes, there are unofficial wheels for various CPUs, but, not sure if that passes your security requirements.) |
|