Y
Hacker News
new
|
ask
|
show
|
jobs
by
CuriouslyC
166 days ago
Pretty interesting. The posterior matching is a big deal, but I'm not convinced by the handwaiving required to demonstrate it in larger models. I'm interested in seeing how direct EM training scales though.