|
|
|
|
|
by morsecodist
391 days ago
|
|
I think it actually makes sense to trust your vibes more than benchmarks. The act of creating a benchmark is the hard part. If we had a perfect benchmark AI problems would be trivially solvable. Benchmarks are meaningless on their own, they are supposed to be a proxy for actual usefulness. I'm not sure what is better than, can it do what I want? And for me the ratio of yes to no on that hasn't changed too much. |
|