|
|
|
|
|
by jychang
35 days ago
|
|
That would be REALLY easy to detect. It'll be 4x slower. The tokens/sec of the model is basically directly proportional of the memory bandwidth of the hardware it runs on. So either OpenAI has to gimp model performance for its entire life, or somehow magically speed it up 4x on the first day. |
|