Hacker News new | ask | show | jobs
by _QrE 399 days ago
How can you call this 'Absolute Zero' if you need to start with a pretrained LLM? From what I understand, this just proposes that you can take an existing LLM, have it generate tasks and solve the tasks, and have it learn from that. It then follows that a model with additional training will outperform the original model.

I'm assuming that I'm misunderstanding something, because this doesn't seem very novel?

Edit: Seems like a variant of adversarial training?

1 comments

if you could improve the LLM without any further data, it would count as absolute zero. I'm highly skeptical however personally.