Hacker News new | ask | show | jobs
by kqr 642 days ago
It could be "probability of token being useful" rather than "probability of token coming next in training data"!