Hacker News new | ask | show | jobs
by daemin 558 days ago
I would imagine the context takes up valuable input tokens that you would otherwise need to use for your request. So you'll run out at some point and then you just have a simple model rather than a skilled engineer.
1 comments

Can't you just cache the context?