|
|
|
|
|
by HarHarVeryFunny
1214 days ago
|
|
I wonder if you could comment on this (related to question of how far ahead these "LLM"s are planning). This is Wharton professor Ethan Mollick playing with the new Bing chat, which seems considerably more advanced than ChatGPT (based on GPT-4 perhaps?). Here he asks it to write something using Kurt Vonnegut's rules of writing. https://twitter.com/emollick/status/1626084142239649792 It seems hard to explain how Bing/GPT could have generated the Vonnegut-inspired cake story, having ingested the rules, without planning the whole thing before generating the first word. It seems there's an awful lot more going on internally in these models than a mere word by word autoregressive generation. It seems the prompt (in this case including Vonnegut's rules) is ingested and creates a complex internal state that is then responsible for the coherency and content of the output. The fact that it necessarily has to generate the output one word at a time seems to be a bit misleading in terms of understanding when the actual "output prediction" takes place. |
|