Hacker News new | ask | show | jobs
by jmoss20 1195 days ago
Which makes you wonder what would happen if you gave ChatGPT two streams of output: one for "speech" and one for inner-monologue-style "thinking".

For most of our "hmm, let me think"'s I'm not sure what we do is significantly more complicated than that. We just get to hide the inner monologue.

2 comments

Letting it decide how much time computing an answer instead of making it spend the same amount of time per text token could definitely help a lot. Right now it spends the same amount of energy processing each token, but that doesn't make sense, "hello, how are you" should take basically no processing while hard logical problems should take a lot.

Maybe these language models could be way smaller and cheaper if we added a recurse symbol to it that made it iterate many times. Hard tasks would still take a lot, but most banal conversations would be very cheap.

This does indeed improve efficiency significantly: https://arxiv.org/abs/2207.07061
>one for inner-monologue-style "thinking".

I'm not sure I know what you're talking about.

And here we go on the aphantasia / does everyone have an inner monologue / can you make your ears rumble / do you sit or stand when pooping train. :P
Make your ears rumble? Stand while pooping? What???