Hacker News new | ask | show | jobs
by rpcope1 16 days ago
I am maybe positing something even stronger: say you had two prompts, both with the same information, one was written in the style of a good paper out of Nature or Science, one written in the style of a bad Twitter post or other kind of mess, even with the same information, I increasingly believe even for the top end models like Opus, the results are at least materially different if not grossly so. I really believe the stronger English input yields demonstrably better outcomes, even if the information contained inside is the same.
1 comments

Of course - remember it is not trying to answer your question or do what you ask, just trying to do what it was trained to do which is generate the statistically most likely continuation of your input. Your input sets the tone for the type of response that is statistically likely to follow. Idiots tend to engage with other idiots, and experts tend to seek out experts to talk to .. this is what it has been trained on, and is how it is selecting the best (most likely) “response” (continuation) to your input.
For many of these tools your prompt is a small part of the initial input, and it shrinks as a proportion of the total as more information is added to the context.

IME you can get great results from crappy prompts if the surrounding context is high quality, and you'll often get terrible results from great prompts if the surrounding context is low quality or non-existent. Same goes for the training corpus, I imagine.

I think success with LLMs is dominated by giving them a well-structured workspace, high-quality reference material, and strict tools with which they can check their work. Then you only need great prompts when venturing into parts unknown. But for that stuff you may as well write it by hand and get the LLM to fill in the gaps IMO.

True - and some may have been RLHF’d to give more rigorous/expert responses regardless of how the question/request is posed. Still, others responding here say that swearing at the model can be helpful (a signal to cut the BS I suppose), and an LLM wouldn’t be doing the job it was trained for if it’s response wasn’t highly attuned to the input