| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by fooker 487 days ago
	Larger llms do pretty well with this. Smaller ones don't.

1 comments

sebzim4500 487 days ago

Large ones do better than small ones but still do worse than I would have expected before I tested them. E.g. `o1` doesn't know things which are repeated several times on wikipedia.

link

fooker 487 days ago

o1 is not too large, and the emphasis is on reasoning rather than memorization.

Try the largest llama models, and phrase your prompt like a sentence to be completed instead of you asking a question.

link