Hacker News new | ask | show | jobs
by francisduvivier 968 days ago
That GPU does pretty well for running an LLM though.

Llama 7B at 2.8 tok/s via Mlc-chat.

I'm setting up an LLM discord bot with it.