|
|
|
|
|
by otabdeveloper4
62 days ago
|
|
Fundamentally they're the same technology with the same exact algorithms under the hood; only the post-training alignment differs. That is, the difference you see is either placebo effect or you being lucky and better aligning with model post-training bias. |
|