|
|
|
|
|
by lukax
239 days ago
|
|
> It’s powered by a version of GPT‑5 that’s trained to look across multiple sources to give more comprehensive and accurate answers. So another GPT-5 fine-tune. Codex also uses a custom GPT-5 fine-tune. Does fine-tuning make sense now? Or do you have to be OpenAI to fine-tune the models with a mix of existing data and new behaviours? |
|
Look at Tinker for an example of where things might be heading though (https://tinker-docs.thinkingmachines.ai/)
At present though, I get the sense that reinforcement learning at scale is the current battleground (and has been for most of 2025). But we also see over time, the general models adopt the skills taught to the specialized models. Look at how the learning that made codex-1 went into GPT5.