Hacker News new | ask | show | jobs
by pwendell 1177 days ago
Yes this was a very surprising result... that the relatively small uptraining was able to unlock so much latent knowledge in the model.