Hacker News new | ask | show | jobs
by jonathan_landy 531 days ago
Seems to depends strongly on the model perhaps. The Reddit post says

“Some other models tested that just didn't work: gpt-4o, gpt-o1, qwen qwq.”

Notably gpt-4o was used in the post linked here.

1 comments

I don't know what they were doing, but I tried o1 with many problems after I solved them already and it did great. No special prompting, just "solve this problem with a python program".