| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by bootsmann 1073 days ago
	Pytorch is an animal by itself when you try to put it into production. They have started addressing it with torch 2.0 but it still has lengths to go. With this you can switch to TFserve if you have usual architecture.

1 comments

cygn 1073 days ago

You can just use Triton which is basically TFserve for Tensorflow, Pytorch, Onnx and more.

link

albertzeyer 1073 days ago

Can you explain that?

My understand of Triton is more that this is an alternative to CUDA, but instead you write it directly in Python, and on a slightly higher-level, and it does a lot of optimizations automatically. So basically: Python -> Triton-IR -> LLVM-IR -> PTX.

https://openai.com/research/triton

link

chillee 1073 days ago

It's confusing, there's OpenAI Triton (what you're thinking of) and Nvidia Triton server (a different thing).

link

jerrygenser 1073 days ago

Original comment is referring to Nvidia triton inference server

link