Hacker News new | ask | show | jobs
by crooked-v 620 days ago
The children's-story pattern, complete with convenient moral lessons at the end, is so aggressive with both ChatGPT and Claude that I suspect both companies have RLHFed it that way to try and keep people from easily using it to produce either porn or Kindle Unlimited slop.

For a contrast, look at NovelAI. They only use (increasingly custom) Llama-derived models, but their service outputs much more narratively interesting (if not necessarily long-term coherent) text and will generally try and hit the beats of whatever genre or style you tell it. Extrapolate that out to the compute power of the big players and I think you'd get something much more like the Star Trek holodeck method of producing a serviceable (though not at all original) story.

2 comments

The holodeck method still requires lots of detail from the creator, it just extrapolates the sensory details from its database like ChatGpt does with language and fills out the story.

For example, when someone wanted a holonovel with Kiera Nerys, Quark had to scan her to create it so when using specific people they have to get concrete data as opposed to historical characters that were generated. Likewise, Tom Paris gave the computer lots of “parameters” as they called them to create the stories like the Adventures of Captain Proton and based on dialog he knew how the stories were supposed to play out on all his creations, if not how they ended each run through.

The creative details and turns of the story still need to come from the human.

In a made up story about a utopian future, and for now in our current reality, that is. There was also that episode where the holodeck created sentience and they put it in a box to explore a generated universe because it was too dangerous to let out into the real world. There's plenty of scifi predictions about the future of humanity, Star Trek's utopian future where humans are unique and necessary is not the only one, there are plenty of dystopian ones too.
>RLHFed

For those of us not steeped in AI culture, this appears to be short for "Reinforcement learning from human feedback".