| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by rboyd 324 days ago

Great work! There should be a way for entities to crowdfund model training. Can a model like this be partially evaluated during training time and save through early stopping?

What are the best papers/resources on sota long-horizon RL?

Thanks.