Hacker News new | ask | show | jobs
by bko 343 days ago
I love these work-arounds and generous tiers. A bit of a tangent, but with very cheap essentially unlimited code generation, are there any active projects that just run this for days straight with an ambitious goal like "Develop an operating system" with instructions to just make all the necessary decisions to continue work?

I would love to see what a system like Claude Code could cook up running continuously for weeks. But I imagine it would get stuck in some infinite recursive loop.

5 comments

Yes I tried pushing it as far as possible over the course of a couple days to invent, build and prioritize the direction of a new programming language (trying to give it as much freedom as possible and make its own decicions while steering it only to not get stuck). After around $50 in tokens it kinda got lost in the complexity it had created and just kept adding more and more useless trivialities while overlooking fundamental unsolved problems.

E.g. it wanted to build a data query language with temporal operations but completely forgot to keep historical data.

It currently lacks the ability to focus on the overall goal and prioritize sub-tasks accordingly and instead spirals into random side quests real quick.

I think you might be aiming too low. Tasked with writing a "perfect and most useful program" this would surely yield something more than merely writing 42 to stdout.
There is ClaudePlaysPokemon which has been failing to beat the game for weeks (months?) now.

edit:

https://www.twitch.tv/claudeplayspokemon

Current llms get lost fairly quickly in larger projects. They still benefit from reduced scope when promoting. Context is the biggest bottleneck right now by far. You can only summarize so much before the information is too vague to make meaningful changes.
It would probably look suspiciously like Linux.
Better that than violating someone’s IP!