Hacker News new | ask | show | jobs
by deepsquirrelnet 152 days ago
This is really awesome detail. I’m very impressed by the amount of care taken to identify a good template. I started a small hook to try and do this using DSPy prompt optimizers, but haven’t had a compelling use case to try it with.

This seems like an ideal case for trying DFT as well. I’m not sure if you’re using trl, but I’d suggest checking that out.

1 comments

We're using an internal fork of trl for some of the steps.