Hacker News new | ask | show | jobs
by lambdaba 486 days ago
> It does seem like Google is quick to respond to feedback.

I'm puzzled as to how that would work, when people talk about quick changes in model behavior. What exactly is being adjusted? The model has already been trained. I would think it's just randomness.

2 comments

Magic

And fine tuning.

Choose your fighter...

High level overview: https://www.datacamp.com/tutorial/fine-tuning-large-language...

More detail: https://www.turing.com/resources/finetuning-large-language-m...

Nice charts: https://blogs.oracle.com/ai-and-datascience/post/finetuning-...

The big platforms also seem to employ an intermediate step where they rewrite your prompt. I've downloaded my ChatGPT data and found substantial changes from what I wrote. Usually for the better. Changes to the way it rewrites changes the results.

System prompts have a huge impact on output. Prompts for ChatGPT/etc are around a thousand words, with examples of what to do and what not to do. Minor adjustments there can make a big difference.