Hacker News new | ask | show | jobs
by tedivm 2660 days ago
Is it actually 3x faster, or does it just scale more?

In other words, if I had a model that previously took three seconds to get a response from would this platform respond in one second?

2 comments

Sorry, I should've been more clear. Both predictions for TF serving and panini serving was done in a single thread in the same specification machine. We used a simple model for image classification of CIFAR dataset. Roughly, 500 predictions were made for panini and 200 predictions for TF serving. The graph on the website is for throughput. I'm planning to write a medium post soon regarding the benchmark test.
We changed the title from "Show HN: 3x Faster Than Tensorflow Serving" to what the page says, which is less baity.

https://news.ycombinator.com/newsguidelines.html

That's definitely an improvement, but I'm hoping someone from the Panini team will step in and clarify regardless.