| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by koakuma-chan 113 days ago
	This seems really weird to me. Isn't that just using LLMs in a specific way? Why come up with a new name "RLM" instead of saying "LLM"? Nothing changes about the model.

2 comments

dbreunig 109 days ago

"Think step by step," was just a sentence you appended to your prompt.

It ended up kicking off reasoning training which enabled the massive gains in coding, tool use, and more over the last 18 months.

So yeah, it's "just using LLMs in a specific way."

link

vimda 113 days ago

RLMs are a new architecture, but you can mimic an RLM by providing the context through a tool, yes

link

anonymousd3vil 113 days ago

New architecture to building agent, but not the model itself. You still have LLMs, but you kinda give this new agentic loop with a REPL environment where the LLM can try to solve the problem more programmatically.

link