|
|
|
|
|
by deepsquirrelnet
240 days ago
|
|
Absolutely the first thing you should try is a prompt optimizer. The GEPA optimizer (implemented in DSPy) often outperforms GRPO training[1]. But I think people are usually building with frameworks that aren't machine learning frameworks. [1] https://arxiv.org/abs/2507.19457 |
|