|
|
|
|
|
by keepamovin
63 days ago
|
|
Funny, I just made https://model-tracker.com because model performance change all the time, and it would be good to have a subjective signal of what people are actually feeling today. And also, benchmarks are flaky af as this paper shows. The idea is knowing what to try first today saves a bit of time. |
|
https://marginlab.ai/trackers/claude-code