|
|
|
|
|
by sanderjd
962 days ago
|
|
How does one efficiently learn how to do such things, and what kinds of problems such approaches are fruitful for? I find there to be a giant gap in learning about this stuff between material that boils down to "use magic words and system prompts to improve results from one of the big models" and "how do LLMs work from first principles". I still haven't found a great resource that covers this middle ground, which seems to me to be where a lot of the power of these approaches is going to reside. |
|
> Design your tasks to be repeatable and small steps, call the OpenAI API and log all requests/responses. > Filter out any bad responses and take a representative sample of the data you have collected from OpenAI,and train a Mistral or Llama2 model with the request/response pairs.
> Measure the quality of your model vs OpenAI for the same inputs, and then swap out the model in your workflow once happy with the results.