|
|
|
|
|
by kcorbitt
403 days ago
|
|
It's very unlikely that they're doing their own pre-training, which is the longest and most expensive part of creating a frontier model (if they were, they'd likely brag about it). Most likely they built this as a post-train of an open model that is already strong on coding like Qwen 2.5. |
|