| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by jean- 2344 days ago
	> When the state-of-the-art is 97% on a task, there's only so much room for improvement. When models commonly achieve 97% on a task, it means it's time to define a harder task, as it's long stopped providing any useful signal.

1 comments

drongoking 2344 days ago

I think when performance on a real world task can be expressed as a single percentage, we've over-simplified the hell out of it and it's time to rethink the problem.

link

Piskvorrr 2344 days ago

Or worse, overfitted - which means the solution will implode when faced with real data (RIP EH).

link