| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by vladf 1167 days ago
	How does this technique differ from the supernet optimization for one-shot NAS? https://proceedings.mlr.press/v80/bender18a.html It seems like they use a fixed-distribution controller for training. It’d be nice to see why it’s worth deviating from the original RL paradigm.

1 comments

whimsicalism 1167 days ago

It's very different, but hard to distill in a comment. They use a new regularization technique to basically create a LoRA with dynamically adjustable rank.

link