|
|
|
|
|
by dinfinity
389 days ago
|
|
That section is not really what the paper is about at all, though. The examples they give of (what they think is) FER in LLMs (GPT-3 and GPT-4o) are most informative to a layman and most representative of what is said to be the core issue, I'd say. For instance: User:
I have 3 pencils, 2 pens, and 4 erasers. How many things do I have? GPT-3:
You have 9 things. [correct in 3 out of 3 trials] User:
I have 3 chickens, 2 ducks, and 4 geese. How many things do I have? GPT-3:
You have 10 animals total. [incorrect in 3 out of 3 trials] |
|
It's hard to say that there is no internal organization since trillion parameter models are hard for us to summarize and we do see some semantic vector alignment in the GPT models but the toy example of the 2 skull image generator present a powerful anecdote of how current ML models find correct solutions but miss a potentially valuable property of having what the paper calls factored representation which seems to be the far more "human" way to reason about data.