Hacker News new | ask | show | jobs
by TheKelsbee 420 days ago
I have this same thought, and have tried similar approaches.

OP: Have you trained or fine tuned a model that specifically reasons the worker model inputs against the user input? Or is this basically just taking a model and turning the temperature down to near 0?

1 comments

Low temperature, heavy prompting to answer in a structured way. Sadly can't fine train models since this is API based but the approach does work!