|
|
|
|
|
by jxy
853 days ago
|
|
LLMs likely don't need proprietary data to train effectively. However, as long as the training data includes references to the NYT, misattribution issues may arise. We certainly need measures to prevent defamation by LLMs, or any text generators, and their creators. It's challenging to determine where to draw the lineāfrom decryption tools that decipher random bits, to web browsers displaying text, to simple text editors, to n-gram Markov chain text generators, to shallow RNNs, to GPT-1, and beyond. Should we hold the tool creators or the tool users accountable for misuse? In my view, the worst outcome of the NYT winning the lawsuit wouldn't be OpenAI halting progress in generative text tools. The real concern is that OpenAI, with its resources, might find technological solutions to these issues, while startups and hobbyists with limited resources could be forced to stop operating entirely. |
|
That's the best outcome in many more views than yours.