Hacker News new | ask | show | jobs
by dontwearitout 508 days ago
How many tokens/s do you get on a 3090? With the extra tokens for the internal monologue, is it still performant enough for smooth VSCode integration?