Y
Hacker News
new
|
ask
|
show
|
jobs
by
akoboldfrying
2 hours ago
Only read the first section but this sounds really impressive -- up to 50% of up to 17% of training time when using the Muon optimiser, so up to around 7% of basically pure improvement with no downside.