| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by xg15 544 days ago
	Ah, that makes sense. So (very) basically, they're putting a number of regular LLMs into a sort of compute chain/graph, where one LLM feeds into the other, then doing gradient descent on the whole chain at once, essentialy treating the boundaries between LLM n and LLM n+1 as "hidden layers"?

1 comments

Author here. Yea, in this fashion. And it can create the feedback using llm as a backward engine