Hacker News new | ask | show | jobs
by slashdave 2 hours ago
Except these models are not run prompt-to-prompt. The infra has to hold the entire context.