Hacker News new | ask | show | jobs
by elfelf12 622 days ago
Is it a copyright problem or a capitalist problem or why do we only get nerfed dumb chatbots?

Would be interesting to really try hard and create a llm that can write novels in the style of an author. And skip the chat functionality!

4 comments

I believe this is neither. I believe this is purely a form of control - not to make money later or lose less money - rather, I believe many are very afraid of how people would use an un-nerfed LLM.

However, it's inevitable.

Perhaps both. But I wonder if the incredible blandness of most chatbots is effectively just a regression towards the mean.

Most AI companies try to train their bots on vast amounts of different data, and I suspect it's very difficult for that to result in very creative writing when you're training on works of fiction, as well as cooking recipes, Reddit comments and technical documentation.

Copying writers is probably a copyright thing. But the experience with generative AI for images was that, at least for the early models, it was good to put things like "masterpiece, highest quality" in the prompt. The model biased towards average rather than trying to maintain a high standard. The more general problem here could easily be that people haven't figured out how to prompt interesting writing from an LLM yet.

Although my personal theory would be that LLMs are just writing how someone without an ego or firsthand knowledge would write - it has a bunch of different angles it could take but has no particular reference to draw on to determine which is true. Great human writers are often cataloguing their extra-literary experiences. How is ChatGPT supposed to be inspired by a beautiful sunset to capture it in a way that has never been done before? It is capable of the writing part, but the inspiration part is a lot harder for it.

The notion that generating something "in the style of" a human creator is a violation of copyright is categorically false. Copyright is only infringed when a work is substantially copied. Generating something new but with a similar feel is fair game. That OUGHT to be (but, obnoxiously, seems not to be) universally uncontroversial.

Human creators might bristle and find it distasteful to have works automatically generated in a style they spent a long time honing, but it is most certainly not a violation of copyright.

There is nothing between the lines.
LLM can continue anything, chat is simply what worked best so far. Outputs being bland and soulless, and lacking in global structure, if I may add, is just architectural. There's nothing behind that.