|
|
|
|
|
by hanoz
1135 days ago
|
|
The more token capacity that's added the more wasteful it seems to have to use this statelessly. Is there any avoiding this? Wonderous as this new tech is, it seems a bit much to be paying $2 a question in a conversation about a 32k token text. |
|