|
|
|
|
|
by notavalleyman
458 days ago
|
|
No, they generally do not compete on accuracy benchmarks afaik. GitHub/openai/simple-evals is what I checked here, and no, openai do not compete on accuracy benchmarks as far as I can tell. So I'd be interested in seeing what led you to think that, and also what led you to earlier claim that anyone typing in the complainant's name saw the same hallucination. |
|
"Get Answers" is literally at the top of ChatGPTs landing page. You think the average person interprets that to mean "Get inaccurate answers"?
Google "AI benchmark" and almost every result is an assessment of the accuracy of various models. What do you think they compete on? How do you think they measure the improvement of one model to the next?
Here's OpenAI's "Optimizing LLM Accuracy" https://platform.openai.com/docs/guides/optimizing-llm-accur...
Pop this in Google and see the pages of results about accuracy: site:openai.com "accuracy". To claim that they don't optimize for accuracy confirms to me that you are not discussing this in good faith. Perhaps you are just trying to be contrarian or something, I don't know.
>and also what led you to earlier claim that anyone typing in the complainant's name saw the same hallucination.
Well, it says right in the article that different people received the same result.
Why are the goalposts moving? Actually, nevermind, I don't care to continue the conversation.