|
|
|
|
|
by Workaccount2
513 days ago
|
|
There is a frustrating gap between benchmarks and real world ability. O1 or even O3 might be able to crack academic level math problems, but I still wouldn't trust it to correctly fill out a McDonalds application using a PDF of my resume and a calendar of my availability. |
|