| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by jrmg 1167 days ago
	Is it now impossible to train another LLM on web input? The genie is out of the bottle--you can spam prompts into anything (webforms, html, etc) and compromise new LLMs. The only reason openAI could do it with chatGPT is that people hadn't realized it yet and spammed the input data with prompts? Wasn't that training the last "clean" dataset? Pre-2023 web crawls will be the low-background steel of future LLM training.

2 comments

TisButMe 1167 days ago

(Author here) that's what I thought originally, but then it means that LLMs never get to learn from new content - current ones stop in 2021, they don't know that Russia invades Ukraine, or that Arc is a cool browser or the API of any libraries released after their end date (which has been an issue for me for code generation using fast moving libraries). I don't think it's good enough to stop acquiring new content.

link

mdale 1167 days ago

There is nothing to prevent a robust hierarchy of rules and training that impacts levels of permissions per operator intent.

OpenAi has made a lot of progress on this in a very short amount of time. Casual jailbreaking or negative role playing is already 100x more difficult then early versions via the ChatGPT chat interface.

We will see more sophisticated robust adversarial filters to untrusted content going forward.

link

TisButMe 1167 days ago

Possibly yes - I think that's my point with predicting peak oil wrong for 50 years. Still, right now it seems every time OpenAI/someone else adds a new content filter, someone figures out a prompt escape that works.

link

tough 1167 days ago

phind gpt4 enabled search fixes the new content bias

link

ericb 1167 days ago

That's a great metaphor!

edit: I predict the internet archive will no longer have funding challenges.

link