|
|
|
|
|
by llm_trw
477 days ago
|
|
A dataset of labelled answers that are hallucinations and not hallucinations are published based on the benchmark as part of a paper. People _seriously_ underestimate just how much stuff is online and how much impact it can have on training. |
|