Hacker News new | ask | show | jobs
by drited 474 days ago
I would be curious about context window size that would be expected when generating ballpark 20 to 20 tokens per second using Deepseek-R1 Q4 on this hardware?