Y
Hacker News
new
|
ask
|
show
|
jobs
by
AmanSwar
99 days ago
MetalRT is metal only inference engine (we are making for other hardwares too). Think of it like SGLang or vLLM but for single batch inference on apple silicon. See this blogpost :
https://www.runanywhere.ai/blog/metalrt-speech-fastest-stt-t...