| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by tedivm 2660 days ago
	Is it actually 3x faster, or does it just scale more? In other words, if I had a model that previously took three seconds to get a response from would this platform respond in one second?

2 comments

avin_regmi 2660 days ago

Sorry, I should've been more clear. Both predictions for TF serving and panini serving was done in a single thread in the same specification machine. We used a simple model for image classification of CIFAR dataset. Roughly, 500 predictions were made for panini and 200 predictions for TF serving. The graph on the website is for throughput. I'm planning to write a medium post soon regarding the benchmark test.

link

dang 2660 days ago

We changed the title from "Show HN: 3x Faster Than Tensorflow Serving" to what the page says, which is less baity.

https://news.ycombinator.com/newsguidelines.html

link

tedivm 2660 days ago

That's definitely an improvement, but I'm hoping someone from the Panini team will step in and clarify regardless.

link