Hacker News new | ask | show | jobs
by Havoc 954 days ago
Is it know what exactly OpenAI does in the background when they make these turbo editions?

Seems like sacrificing some quality for large gains on speed and cost but anyone know more detail?

1 comments

Don't think so, but there were some guesses on 3.5-turbo-- i.e. training a much smaller model on quality questions/answers from GPT-4. Same tactic worked again and again for other LLMs.

I'm definitely curious on the context window increase-- I'm having a hard time telling if it's 'real' vs a fast specially trained summarization prework step. That being said, it's been doing a rather solid job not losing info in that context window in my minor anecdotal use cases.