|
|
|
|
|
by johnnyfived
162 days ago
|
|
There'd first have to be an intense evaluation and standardization process for AI / measuring AGI now. All current benchmarks are tailored to one use case (e.g. SWE) or are evaluations that can be gamed and manipulated. I think this would take the form of something more abstract instead of concrete with raw numbers, like a revised Turing Test. |
|