Rust is a lot better than Python to run ML in production

Y	Hacker News new \| ask \| show \| jobs

	Rust is a lot better than Python to run ML in production (mectors.medium.com)
	5 points by mectors 545 days ago

1 comments

mectors 545 days ago

Python Docker images with CUDA, Python, Pytorch,... are 5GB to 10GB of third-party code. Here is an example of bringing this down to 1.13GB thanks to WasmEdge, 10MB of LlamaEdge API Server [compatible with ChatGPT] and 1.17GB for TinyLlama.

link

verdverm 545 days ago

If you are not actually running ML code, of course you can remove Python and ML libraries. You could get this down to MB by removing WASM and using a Go binary

apples and oranges

link