|
|
|
|
|
by qumpis
1158 days ago
|
|
I wonder why it's slower at inference time then (for members using their web UI), or rather, if it's similar in size to gpt3, how gpt3 is optimized in a way that gpt4 isn't or can't be? I'd expect that by now we would enjoy similar speeds but this hasn't yet happened. |
|