|
|
|
|
|
by ComputerGuru
59 days ago
|
|
Seems to be llm written article and the tooling around the model is undeniably influenced by knowledge of the tests. In all cases, GPT 3.5 isn’t a good benchmark for most serious uses and was considered to be pretty stupid, though I understand that isn’t the point of the article. |
|