Y
Hacker News
new
|
ask
|
show
|
jobs
by
attemptone
376 days ago
We were talking about linear improvements and I have yet to see it
1 comments
mountainriver
376 days ago
check the benchmarks or make one of your own
link
attemptone
375 days ago
I checked the BlEU-Score and Perplexity of popular models and both have stagnated around 2021. As a disclaimer this was a cursory check and I didn't dive into the details of how individuals scores were evaluated.
link
mountainriver
375 days ago
on what benchmarks? pretty much every major one is linear improvement
link