| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by marcopicentini 1025 days ago
	What are other use cases that could be made only by LLM ? Number sorting is faster using code.

2 comments

empath-nirvana 1025 days ago

The point of using number sorting for this paper is that its

A) difficult to impossible for an LLM to do in a single pass B) easy to verify the correctness.

In general, the point isn't finding things that only an LLM can do, but find things that LLMs can do with decent results at lower cost than getting a human to do it.

link

jbay808 1025 days ago

It is only difficult for a LLM to sort a list of numbers if the list is longer than half of the context window. (Source: I tested this myself[1]). The sorts are not error-free every time, but with sufficient training they become error-free the vast majority of the time, even for long lists. This is not especially surprising because transformers are capable of directly representing sorting programs.[2]

[1] https://jbconsulting.substack.com/p/its-not-just-statistics-...

[2] https://arxiv.org/abs/2106.06981

link

empath-nirvana 1024 days ago

Of course you can train a neural network to sort numbers, but I'm talking about a general LLM which hasn't been trained to sort numbers specifically. Training a GPT network to sort numbers is not what I would consider to be a Large Language Model.

link

creer 1025 days ago

I don't think efficiency is important at this point. Finding that it's possible "this way" opens the door for more work and more applications. (Which doesn't prevent others to already work on efficiency.)

link