| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by kvetching 137 days ago
	I don't see the problem with this. The chatbot is the most important part of Grok, so it makes sense Elon would be dogfooding it then providing suggestions.. He wants it to be truthful... It was shown on benchmarks recently that it hallucinates the least...

3 comments

Braxton1980 137 days ago

>He wants it to be truthful

How do you know this? Why would you believe him considering the massive lies he's told, for example about the 2020 widespread election fraud

link

kvetching 137 days ago

https://artificialanalysis.ai/evaluations/omniscience?omnisc...

AA-Omniscience Hallucination Rate (lower is better) measures how often the model answers incorrectly when it should have refused or admitted to not knowing the answer. It is defined as the proportion of incorrect answers out of all non-correct responses, i.e. incorrect / (incorrect + partial answers + not attempted).

Grok 4.2 which was just released in the API just benched the best at this benchmark.

link

SideQuark 137 days ago

Of all the valuable metrics on that site, all of which grok does badly at except one, you managed to pick that single one.

https://artificialanalysis.ai/models

link

Braxton1980 136 days ago

This isn't a response to my question. I asked why you trust him

link

SouthSeaDude 137 days ago

I totally agree, it's his company 100%, why would you even apply for a job in a company where you don't agree with the owner or his vision.

link

jazzpush2 136 days ago

Do you think the investors of xAI want this behavior baked into the model? Do you think other frontier labs enforce their models to praise their CEO and never insult them?

And, how does this fit into a vision, exactly? What vision might that be beyond "I am only to be praised?"

link

karmakurtisaani 137 days ago

Some of us have a pesky addiction to food and shelter.

link

etchalon 137 days ago

He wants it to tell the truth as he sees it.

link

timacles 137 days ago

Truth doesn’t have the right training weights for Elon

link