Hacker News new | ask | show | jobs
by valine 929 days ago
How much data do you need for UNA? Is a typical fine tuning dataset needed or can you get away with less than that?
2 comments

In addition to what was said, if its anything like DPO you don't need a lot of data, just a good set. For instance, DPO requires "good" and "bad" responses for each given prompt.
doesn't require much data, in a 7B can take a couple hours ~
That’s cool. A couple hours on a single GPU or like 8x a100s?