| > . It's still just looking up function to do x, function to do y and applying it to the output. I mean no, no it isn't. I'm giving it info on how to construct data models with a custom library, so interacting with that is not using anything previously stored, and then giving it businesses/tasks to model as simple human descriptions. If you tell me that something which * Takes a human description of a problem * Describes back to me the overall structure and components required to solve it with a hierarchy * Converts that into code, correctly identifying where it makes sense for an address to be contained within a model or distinct and referenced * Correctly reuses previously created classes that are obviously not in its original dataset has no understanding or reasoning and it just regurgitating things it's seen before simply mashed together, I don't know what to say. Frankly > it's taking several based on probability and merging them into what it thinks we're looking for. I Sounds pretty much like understanding and reasoning to me. > but there's no need to exaggerate it as magic. I'm absolutely not saying there's magic. Humans aren't magic and they can do reasoning. I'm saying it's not just looking up text and regurgitating it. I think this is supported by things like othello-gpt, which builds an internal world model and outputs based on that. |
It's impressive that it knows the difference between "how many are 5 more apples than 10" compared to "how many percent are 5 apples of 10" (I don't know if it does, just assuming). But the first release also tried to reason why the weight of 1 pound of nails depends with the simple prompt "how much do 1 pound of nails weigh". That's most likely a perfect example of it mashing the classic "what weighs more, 1 pound of nails or 1 pound of feathers".
It IS just looking in a database, and mashing it with some fluff. I'm happy to be proven wrong but I need more than your word for it. My experience is that as the topic gets more niche (less data in the training set) the worse the answers I get and it starts making things up based on probability. It doesn't reason in the sense I assume you're expecting.