Hacker News new | ask | show | jobs
by throwawayadvsec 1107 days ago
extremely unlikely

LLMs tokens are usually common word or parts of word, and it would be extremely weird for copilot to output them verbatim in generated code(I've actually tried a few times), or it would be random invalid keys since there is no real patterns in API keys

+I'd be shocked if they weren't automatically stripped from the training data

1 comments

I’m not sure how it’s implemented, but when CoPilot suggests code with an inline API key or similar it seems to reliably generate a sequential alphanumeric sequence that is discernible at a glance from real data.

I’m sure there are edge cases, but I’ve been surprised how well it handles this.