| Apologies. Your pushback (frustration and patience) has helped me crystalize my view, thank you. 1. Define understanding. My definition isn't vague: "a compact representation enabled because that representation's topology closely matches the topology of the relationships being modeling." Understanding = Scope and Suitability of Behavior / # Parameters. Useful property: This definition applies across all scales: Scientists and mathematicians increase our understanding, every time patchworks of relationships get replaced with a simpler underlying insight. Another useful property: It distinguishes between better understanding and having more facts. Facts improve performance but do not (non-trivially) decrease parameters. What is your definition? In measurable terms? 2. You keep avoiding a basic aspect of modeling: Higher compactness is achieved by higher representation correspondence between a model and the modeled. Yes, lower level representations can work. Even well, without good "understanding". But not as compactly. And as problem complexity grows, the relative difference in parameter budgets for high-correspondence and low-correspondence representations explode. This is not a subtle effect. The hallmark of lower-level fitting is the far greater number of parameters required. Dead simple example: Piece-wise linear vs. polynomial fitting of Bezier curves. Accuracy / parameter is far greater for the latter, because the representation matches the relationships being modeled. That is an intentionally trivial example, but the same relationship holds for any problem. You keep avoiding that. 3. Today's LLM models are very compact compared to humans. Compressing the substance of a corpus of global human writing into less than 1% of a single human's parameter space is compact. Humans have 100–200 trillion, some people think 500 trillion, synapses. How do you argue that behavior scope and suitability / parameters is not remarkable, when it is remarkable compared to any specific human you could point to? No human can converse reasonably across the scope of global communication. But these models can. For <1% of a human's parameter budget. 4. Finally, based on your clear definition, how do you argue that humans understand but models do not? Saying we are different is a copout. Defining understanding as us vs. other is both circular and unenlightening. And ignores the real progress models are clearly making relative to humans. |