Hacker News new | ask | show | jobs
by xwdv 1172 days ago
It’s not really. ChatGPT could already do all those things. This just presents it for a different use case.
2 comments

I think you're ignoring the work that went into this as well as the useful technology that came out. Prompts make or break interactions with LLMs and ChatGPT especially. The difference in output from a naive prompt and a well crafted one is huge. This paper is one of many explorations of what happens when you design such prompts to work in an iterative fashion building upon the previous conversation text to produce emergent behaviour. These are the seeds of the next programming paradigm on a completely novel architecture. It's incredibly exciting to be present for the beginning of this field, this is what mathematicians must have felt like when they helped design and program the first computers.
Oh yeah I agree that it _could_ do all those things, but it would be a bit of overkill to always send every observation an agent encounters into the API/chatbox, and ask it to spit out an evaluation or action.

This paper does a nice job of separating the "agency" from the next word with context type predictor. I think that's why I like the paper, it is just chatgpt, in the same way that pizza is just dough, sauce, and cheese.

Yes, but I think this was a fairly obvious conclusion to imagine isn’t it.

If you were going to seriously consider using ChatGPT for AI in a game, you would need each instance of GPT to only know certain information it has gathered. And you would want it to reflect on observations to come up with new thoughts that weren’t observed.

Still, I’d argue you don’t really even need GPT for any of the above. GPT is useful if you want thoughts expressed as natural language, but you could easily code observations and thoughts into an appropriate abstract data structure and still have the same thing, except it’s a bit harder to understand since asking an NPC something in a language it understands and getting back a query result isn’t user friendly, but it can be just as amazing if you know what the data represents. The imprecision and fuzziness of an LLM leaves room for fun weirdness though.