Hacker News new | ask | show | jobs
by sn0wr8ven 750 days ago
There definitely are smaller LLMs that can run on consumer computers, but as for their performance... You would be lucky to get a full sentence. On the other hand, sending and receiving responses as text is probably the fastest and most realistic way to implement these things in games.
1 comments

I've gone past the 8k context window with very good text generation on llama3. I don't know what you're smoking.