|
|
|
|
|
by vikp
898 days ago
|
|
The size of the framework is not the most important factor - the model weights are usually 10x+ the size of the framework. The most important factor is inference speed. For something called Nitro, I really expected speed benchmarks. I'd be interested in CPU, CUDA, and MPS at different batch sizes. |
|