|
|
|
|
|
by rubyskills
1084 days ago
|
|
This isn't quite accurate. GPT3.5 is 4k tokens and has a 16k version
GP4 is 8k and has a 32k version. You are correct that this needs to account for both input and output. I suspect that when you feed chat gpt longer it prompts, it may try to use the 16k / 32k models when it makes sense. |
|