| HN Mirror

I just asked chatgpt-4o and the answers were perfectly logical although not creative at the level of a creative human (but many humans are not that creative either.

For example one of the outputs:

"Host an event where statistical mechanics concepts are explained or demonstrated while making pizzas, all set to a backdrop of live techno music. The music could be dynamically generated based on real-time data from the pizza-making process, perhaps using sensors to monitor heat, time, or the distribution of toppings, with this data influencing the techno tracks played."

It not doing such a bad job trying to mix up three unrelated concepts. It knows music is not an ingredient for the pizza and knows that pizza requires heat for cooking and that heat is explained with statistical mechanics.

Sure you can nitpick and find nuances that are wrong but honestly an average human asked to come up with something for a school assignment would probably not do a much better job.

Now, there are clearly better examples of utter failures where even the best model trip on that reveals that they are not even close at understanding and modeling the world correctly.

My point is just that their weakness cannot merely explained by the next token prediction process.