|
|
|
|
|
by pseudosavant
886 days ago
|
|
I thought so too. Could it be that gpt-4 turbo is more efficient for them to run, so the price is lower, but tries to maintain the token throughput of GPT4 over their API? There are a lot of ways they could allocate and configure their GPU resources so that GPT-4 Turbo provides the same per user throughput while greatly increasing their system throughput. |
|
Could the data have been collected when the system is under different loads?