|
|
|
|
|
by grog454
177 days ago
|
|
It's hard to have any certainty around concealment unless you are only testing local LLMs. As a matter of principle I assume the input and output of any query I run in a remote LLM is permanently public information (same with search queries). Will someone (or some system) see my query and think "we ought to improve this"? I have no idea since I don't work on these systems. In some instances involving random sampling... probably yes! This is the second reason I find the idea of publicly discussing secret benchmarks silly. |
|