|
|
|
|
|
by an0malous
9 days ago
|
|
They’re the language part of the puzzle, which seems to require some basic world modeling but it can’t make novel models unless there’s an example in its training data. I think engineering and mathematical thought requires spatial reasoning, when I model problems I see them as 3D shapes. Like the economy is a series of tubes that money flows through and collect in buckets, programming state is little boxes that hold values, chemical interactions are like keys that fit into locks. I don’t think LLMs can build models like that, but because it has so much memorized and there usually isn’t a need for a novel model custom fit for a problem, it can fake it by imitation. |
|
But the only way to map the network in an LLM is experimentally. You have to prompt it, and see how the coefficients fall in order to construct your most likely walk through the training data.
I think that LLMs can and do come up with novel things through exhaustion, just by applying the relationships between some set of entities to entirely different sets of entities because an accumulation of earlier context pushed the probability of those entities being mentioned, and they were able to easily replace a selection of entities that were more associated with those nearer connective, relationship words.
I think that as such LLMs are good at generating metaphors, and a lot of innovation comes from going "What if As worked like Bs?" Just go through all the As and Bs, toss the ones that don't make any sense and test the ones that seem like they might.