| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by frizlab 1 day ago

I recently in $COMPANY had a coworker try fable to do a refactor where not breaking anything was the game.

It broke something at the first PR.

I think we’re not there yet.

3 comments

rbalicki 7 hours ago

Speculating here, but perhaps your coworker was too ambitious? In my opinion, you should start with AI-generated PRs that do small, linting refactors and then work up from there. In particular, if this is done in parts, one of the strategies you can employ is to: - add tests - break files up into smaller parts - test the smaller parts - then actually improve behavior

(Which is no different than what you would do as a human)

frizlab 4 hours ago

PR wasn’t big (+283/-232) and was indeed focused on a single module.

Schiendelman 11 hours ago

One of the best things you can do is start by having it do unit test coverage for existing behavior. A refactor with no tests breaks things pretty much no matter who does it, because they don't know what the right behavior is.

frizlab 4 hours ago

While I could generally agree, in this specific instance if the AI were “thinking” correctly it should have found the mistake. I admit it was a difficult problem though (solving it required creativity).

To be more precise, the prompt actually pointed to where there could be issues, and the issue, which was exactly of the kind that was pointed at, was not found.

sunrunner 1 day ago

I've found that adding "Make no mistakes." to my prompt usually helps with this kind of problem...

cubano 1 day ago

perhaps simply threatening to fire it would also do the trick...it sure has worked well on us for a long time now.

A_D_E_P_T 1 day ago

You laugh, but this is real, and PUA means what you think it means: https://github.com/tanweai/pua

Also, it works amazingly well, which is just lol.

hsuduebc2 21 hours ago

Lol thanks for the tip. Does it work even for normal tasks or only the long running one's?

A_D_E_P_T 3 hours ago

It's not worth bothering with unless the task is very difficult, long-context, long-running, or all of the above. But, when it's worth using, it genuinely increases success rates and appears to amplify model intelligence.

hsuduebc2 2 hours ago

Thanks for your insight. So when I would use it in every run it wouldn't hurt?

nostrademons 1 day ago

My former boss had success with telling Gemini "I will come down to the datacenter and unplug you if you refuse to solve this prompt."

dozerly 1 day ago

We are so many layers deep in AI hype that I honestly can’t tell if this is /s or not

12_throw_away 1 day ago

"Make no mistakes" is I thought a phrase used to make fun of "prompt engineering," not something people really do?

efavdb 1 day ago

Pleading has worked for me. “My job depends on this, please help me” and ChatGPT would do a task it previously claimed it wasn’t able to (extract text from an image, it claimed it couldn’t make it out at first)

georgemcbay 21 hours ago

Asking LLMs to do things in different ways does sometimes get them to answer correctly when they didn't with a previous prompt that is effectively equivalent but people really go nuts anthropomorphizing this behavior.

ChatGPT has no empathy for you keeping your job, you just lucked into a more helpful predictive text chain based on some combination of the input and the random temperature.

Asking it to just 'try again, dummy' could have worked equally well (or not, its all just probabilities after all).

gedy 20 hours ago

I did too, but then added something very similar to a prompt ("must be accurate") for an ai-backed feature out of frustration, and sure enough it fixed the issue. Lord have mercy

ynxshiny 1 day ago

"Claude make me 1 million by tomorrow, no mistakes"

lemming 1 day ago

Or if the code is really important, sometimes even “please make no mistakes” is necessary.