| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by jmoss20 1195 days ago
	Which makes you wonder what would happen if you gave ChatGPT two streams of output: one for "speech" and one for inner-monologue-style "thinking". For most of our "hmm, let me think"'s I'm not sure what we do is significantly more complicated than that. We just get to hide the inner monologue.

2 comments

Jensson 1195 days ago

Letting it decide how much time computing an answer instead of making it spend the same amount of time per text token could definitely help a lot. Right now it spends the same amount of energy processing each token, but that doesn't make sense, "hello, how are you" should take basically no processing while hard logical problems should take a lot.

Maybe these language models could be way smaller and cheaper if we added a recurse symbol to it that made it iterate many times. Hard tasks would still take a lot, but most banal conversations would be very cheap.

link

jmoss20 1193 days ago

This does indeed improve efficiency significantly: https://arxiv.org/abs/2207.07061

link

flangola7 1195 days ago

>one for inner-monologue-style "thinking".

I'm not sure I know what you're talking about.

link

taneq 1195 days ago

And here we go on the aphantasia / does everyone have an inner monologue / can you make your ears rumble / do you sit or stand when pooping train. :P

link

zoklet-enjoyer 1195 days ago