|
|
|
|
|
by Dwolb
1093 days ago
|
|
The weird thing is that training on text encodes information on related concepts. from u/gmt2027 >An extreme version of the same idea is the difference between understanding
DNA vs the genome of every individual organism that has lived on earth. The species record encodes a ton of information about the laws of nature, the composition and history of our planet. You could deduce physical laws and constants from looking at this information, wars and natural disasters, economic performance, historical natural boundaries, the industrial revolution and a lot more. and u/thomastjeffery >That entropy is the secret sauce: the extra data that LLMs are sometimes able to model. We don't see it, because we read language, not text. |
|