Hacker News new | ask | show | jobs
by jean- 2344 days ago
> When the state-of-the-art is 97% on a task, there's only so much room for improvement.

When models commonly achieve 97% on a task, it means it's time to define a harder task, as it's long stopped providing any useful signal.

1 comments

I think when performance on a real world task can be expressed as a single percentage, we've over-simplified the hell out of it and it's time to rethink the problem.
Or worse, overfitted - which means the solution will implode when faced with real data (RIP EH).