| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by fblgit 930 days ago
	UNA: Uniform Neural Alignment. Haven't u noticed yet? Each model that I uniform, behaves like a pre-trained.. and you likely can fine-tune it again without damaging it. If you chatted with them, you know .. that strange sensation, you know what is it.. Intelligence. Xaberius-34B is the highest performer of the board, and is NOT contaminated.

1 comments

valine 929 days ago

How much data do you need for UNA? Is a typical fine tuning dataset needed or can you get away with less than that?

link

brucethemoose2 929 days ago

In addition to what was said, if its anything like DPO you don't need a lot of data, just a good set. For instance, DPO requires "good" and "bad" responses for each given prompt.

link

fblgit 929 days ago

doesn't require much data, in a 7B can take a couple hours ~

link

valine 929 days ago

That’s cool. A couple hours on a single GPU or like 8x a100s?

link