Y
Hacker News
new
|
ask
|
show
|
jobs
by
kqr
642 days ago
It could be "probability of token being useful" rather than "probability of token coming next in training data"!