Hacker News new | ask | show | jobs
by saghm 44 days ago
I feel like you're making a logical leap here by assuming lossy and failure to reproduce in entirety implies inability to recognize. As a trivial example, I can take a sha256 hash of your comment here, lose the ability to reproduce it, but still have an extremely accurate ability to recognize whether some text is exactly your comment or not. Obviously hashing every substring would not be a particularly efficient strategy, but my point is that saying "it's lossy" isn't particularly compelling without other details.
1 comments

The example you've provided just adds noise.

sha256 is deterministic, LLMs are not, even at temperature set to 0.