Hacker News new | ask | show | jobs
by CamperBob2 3 days ago
How do I get that loss, though, without the softmax inputs?
1 comments

Do they have logits for all of the Wikipedia etc that they've scraped?