| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by charlierguo 609 days ago

It's fascinating/spooky how different LLMs are slowly developing their own "personalities," so to speak. And they seem to be emerging as we're giving them access to more tools and modalities which are harder to do broad RLHF on.

With computer use, we first learned that Claude sometimes takes breaks to browse pictures of Yosemite, and now this:

> Claude really likes Firefox. It will use other browsers if it absolutely has to, but will behave so much better if you just install Firefox and let it go to its happy place.

5 comments

abixb 609 days ago

>Claude really likes Firefox.

I don't mind being reigned over by AI overlords that'll choose FOSS over proprietary.

link

photonthug 609 days ago

>> > Claude really likes Firefox. It will use other browsers if it absolutely has to, but will behave so much better if you just install Firefox and let it go to its happy place.

It's hard to ignore the glimpse into the future of engineering that we're seeing here. Deterministic processes are out the door, no specs, no tolerances, no design. When did undefined behaviour become a cute thing that we're bragging about and compensating for, something to work around rather than something to understand and to fix?

It's not a big deal until you realize that software always gets stacked on software, and the only thing that ever made that complexity manageable was the fundamental assumption that it was all pretty deterministic. Of course users will sacrifice the strategic (good engineering) for the tactical (mere convenience) all day long, but the fact that so many engineers are all-in on the same short-sighted POV has been surprising to me.

link

danudey 609 days ago

> we first learned that Claude sometimes takes breaks to browse pictures of Yosemite

We learned what now?

link

abixb 609 days ago

For those lacking context: https://x.com/anthropicai/status/1848742761278611504

From the Anthropic tweet (X post?):

"Even while recording these demos, we encountered some amusing moments. In one, Claude accidentally stopped a long-running screen recording, causing all footage to be lost.

Later, Claude took a break from our coding demo and began to peruse photos of Yellowstone National Park."

link

danudey 609 days ago

SkyNet with ADHD, great.

link

fullstackchris 609 days ago

I dont know about you, but sounds like every lazy developer I know... this must be proof of AGI! :D

link

m463 609 days ago

step 2: make posts to hacker news with source code link, causing reproduction of Agent.exe, possibly with mutations via forking

link

tomjen3 609 days ago

I mean if the goal is to humanize and make AIs more relatable, then fine.

If it had stopped the coding task to browse hackernews, I would have to start to march for AI rights.

link