Hacker News new | ask | show | jobs
by airgapstopgap 1070 days ago
No, it makes sense to secure engagement with the most expensive implementation and then cut costs, this kind of stuff is pervasive in the industry. Besides, we have Brockman on record saying that they do "a lot of quantization"[1][2] so it's not paranoia to suspect other optimization schemes when there's a clear performance drop, which they have also denied a few times.

1. https://chat.openai.com/share/44a0c5b6-c629-470a-992f-8cdbbe...

2. https://www.youtube.com/watch?v=_hpuPi7YZX8

1 comments

Paranoia would be charitable: it's FUD.

If you intentionally smear the line between their web app which is chock full of optimizations to even let it function as it does (the web app's max conversation length exceeds the context window) and the API which is versioned and iterated on in the open... it's either a lack of understanding or FUD.