Hacker News new | ask | show | jobs
by stymaar 16 days ago
That could be. Just use pre-training for language understanding and let the post-training on synthetic data do the heavy lifting.