|
|
|
|
|
by ricardobeat
319 days ago
|
|
This is meaningless without knowing which model, size, version and if they had access to search tools. Results and reliability vary wildly. In my case I can’t even remember last time Claude 3.7/4 has given me wrong info as it seems very intent on always doing a web search to verify. |
|
A not-so-subtle example from yesterday: Claude Code claiming to me yesterday assertion Foo was true, right after ingesting the logs with the “assertion Foo: false” in it.