| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by BoorishBears 1001 days ago

That is simply noise.

Surely you saw the sibling comment that tries to make the exact same point and did so hours ago, the reply is the same:

> There are toy examples of fine tuning in facts that are not of use outside of academic considerations at this point, and I sense it's contributing to the widespread confusion about fine-tuning's value proposition

The answer for someone asking that question is a strict no. Many people asking this stuff only have access to SFT, so it's a super no for them.

Honestly I don't get this weird obsession right now with LLMs and throwing random roadblocks in any sort of common knowledge of the subject. If someone in CS 101 asked if they could write a game engine in CSS you wouldn't get people lining up to tell them the answer isn't "No." despite it technically being possible (https://github.com/brookjordan/css-game-engine) because we understand that sometimes to enable understanding of a subject you need to setup some solid ground for new entrants to stand on.

Fine-tuning is not for knowledge. If you get comfortable enough to start experimenting with that application, you'll understand that there's some nuance to that statement either way and get to research/tinker/push boundaries armed with enough knowledge to not accept the simple no.

It's no different than teaching the Bohr model of the atom: we know it doesn't hold up to discoveries that you'll come across after it is established, but it doesn't matter because by the time you know enough to revisit the topic, you understand why the answer was a flat no then and can move past it on your own.

OP could have googled the topic but they asked human beings the question. They likely presumed they'd use their human sensibilities to understand the underlying intent of the question instead of parroting a list of toy experiments that would have zero benefit to them.

1 comments

zwaps 1000 days ago

Having literally done it in an enterprise setting (and participated in experiments for some of the largest companies in the world in their respective domain fields), I have to say: your lack of nuance and abundance of arrogance does not come across very well.

It is important to distinguish between something being impossible, infeasible and not well understood. Fine-tuning "for effect" is mostly the latter.

You say "current fine-tuning techniques can only contribute to knowledge indirectly" and then in the next post row back to "except in toy examples" because the former is - literally - not correct.

This is HN. We are not advising clients on how "to get their data into their AI best". We can discuss here the actual technical detail of a thing. An intellectually honest discussion begins with saying: "From a scientific standpoint, and even from a practical standpoint, we are not sure yet, however..."

link

BoorishBears 1000 days ago

"advising clients" is such an odd way of describing "making a complex topic approachable"

But you're correct, this is HN: so much pontificating without producing a single counterfactual implies you should speak for yourself and not the collective.

They said "LLM", but given the context it's an RLHF LLM, and presumably they want a generalized way to add factual information in a way that doesn't cripple the model's general performance (yes, I am being so arrogant as to draw obvious conclusions to give them a useful answer)

No paper on the subject has achieved this, the ones that come close (and by close I mean very far) fall back to BERT sized models which I already addressed below: so please petition your "enterprise" to share their secrets

(wrong crowd to get any gravitas out of the word enterprise btw, we understand it means "constrained usecase with minimal external validation")

link