| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by francisduvivier 968 days ago

That GPU does pretty well for running an LLM though.

Llama 7B at 2.8 tok/s via Mlc-chat.

I'm setting up an LLM discord bot with it.