|
|
|
|
|
by 37ef_ced3
1394 days ago
|
|
For small-scale transformer CPU inference you can use, e.g., Fabrice Bellard's https://bellard.org/libnc/ Similarly, for small-scale convolutional CPU inference, where you only need to do maybe 20 ResNet-50 (batch size 1) per second per CPU (cloud CPUs cost $0.015 per hour) you can use inference engines designed for this purpose, e.g., https://NN-512.com You can expect about 2x the performance of TensorFlow or PyTorch. |
|