Hacker News new | ask | show | jobs
by zozbot234 3 hours ago
DeepSeek V4 (both Flash and Pro) has very good scaling of context length wrt. RAM use, so this is not an inherent limit of LLMs in general.