Hacker News new | ask | show | jobs
by valine 1189 days ago
The token limit is the problem, in general token limits can’t be changed after the model has been trained. Gpt4 has an exceptionally large 32k token limit, but even with 32k tokens you’d only get a few weeks of chat before the context window was full.

Not to mention the added cost of using the full 32k tokens. OpenAI is charging $0.12 a token which would quickly add up. It’s prohibitively expensive unless you have a very very compelling business use case.

1 comments

Maybe trim chat history to most important content?