| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by phreeza 155 days ago
	But this is missing exactly the gap which OP seems to have, which is going from a next token predictor (a language model in the classical sense) to an instruction finetuned, RLHF-ed and "harnessed" tool?

1 comments

It will give you an answer to the extent anybody can.