|
|
|
|
|
by adamisom
201 days ago
|
|
Personally, I don't understand how LLMs work. I know some ML math and certainly could learn, and probably will, soon. But my opinions about what LLMs can do are based on... what LLMs can do. What I can see them doing. With my eyes. The right answer to the question "What can LLMs do?" is... looking... at what LLMs can do. |
|
You should be doubly skeptically ever since RLHF has become standard as the model has literally been optimized to give you answers you find most pleasing.
The best way to measure of course is with evaluations, and I have done professional LLM model evaluation work for about 2 years. I've seen (and written) tons of evals and they both impress me and inform my skepticism about the limitations of LLMs. I've also seen countless times where people are convinced "with their eyes" they've found a prompt trick that improves the results, only to be shown that this doesn't pan out when run on a full eval suite.
As an aside: What's fascinating is that it seems our visual system is much more skeptical, an eyeball being slightly off created by a diffusion model will immediately set off alarms where enough clever word play from an LLM will make us drop our guard.
0. https://en.wikipedia.org/wiki/ELIZA_effect