|
|
|
|
|
by SchemaLoad
28 days ago
|
|
Except 99% of the time they are asking it's because they explicitly need a real opinion or the info couldn't be found via LLMs. But instead of giving an "I don't know", they paste back an wall of text with an incorrect answer that the sender hasn't even read or verified to be true. At least with "I don't know" the asker can move on to someone who might know faster. |
|
Different reward function, but the same behaviour emerges.