Hacker News new | ask | show | jobs
by ben_w 1088 days ago
I doubt the frustration/swearing combo is going to be of great significance.

Yes, its insistence on verbosity gets to me too, even though (as I understand it) that verbosity is the only place it has for any extra "deep thinking" about stuff and thus actually necessary for improved performance.

But it was trained in the first place on a (filtered) form of common crawl, so it probably already had all that.

"The AI swearing" is easy mode for alignment, both because it is low-damage and the availability of trivial filter-based solutions, so it only matters to the larger alignment problem in so far as it's a warning sign we still don't know what we're doing, not in and of itself.