Y
Hacker News
new
|
ask
|
show
|
jobs
LLMs: Solvers vs. Judges
(
bensantora.com
)
2 points
by
truelinux1
103 days ago
1 comments
truelinux1
103 days ago
I gave several LLMs a logic puzzle with an embedded contradiction. Some flagged it. Some quietly bent the rules to produce an answer anyway.
Knowing which type of model you're using (a helpful solver or a strict judge) really matters.
link
Knowing which type of model you're using (a helpful solver or a strict judge) really matters.