|
|
|
|
|
by tasty_freeze
1640 days ago
|
|
> all the correct answers That is clearly not possible, so it can't be what they are doing. Rather than diffusely encoding that knowledge in a massive number of self-organized layers of weights, it is explicitly encoded. The remaining network can "focus" on mapping input to retrieve the relevant information stored in that database, and extracting/interpolating/extrapolating that information based on the current context to generate useful output. |
|