|
|
|
|
|
by zw123456
566 days ago
|
|
The Tower of Babel was a library that contained every possible combination of letters to form a 400 page book. Or something like that. It made me wonder, what if you made a content honey pot full of just random text and a chatbot vacuumed that up? Does it's data vacuum have a garbage detector? |
|
However, it has become clear that effective LLM training is in large matter a matter of careful curation of high quality training data. Random gibberish is trivially detectable, by LLMs themselves if nothing else, so it's unlikely that your "honeypot" will ever make it into someone's training run.
Even if you carefully crafted some more subtle poison data, it would still form only a small amount of the training set. The worst case scenario is most likely that the LLM learns to recognize your particular style of poison, and will happily recreate it if prompted appropriately (while otherwise remaining unaffected); more likely, your poison data is simply swamped.