|
|
|
|
|
by bonoboTP
840 days ago
|
|
I don't think this is a fundamental limitation. If the LLM is trained (through RLHF or something else) to go on a chain-of-internal-monologue (which may take arbitrarily long) to figure out what it will answer, then this kind of adaptive amount of compute can be achieved. |
|