| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by weisser 139 days ago
	What sort of context do you give the APIs when you are starting the game? Does it need to learn the rules as it goes?

1 comments

mbh159 139 days ago

We have a standard harness for each of the model's that we test. Each prompt includes the rules, access to memory, and a lookup of the complete ruleset. The prompt adapts adding legal actions per turn and guidance depending on the stage of the game (updated based on the technological progress of the player).

Unlike RL algorithms these LLMs wouldn't learn quick enough without the prior knowledge the harness provides

link

weisser 139 days ago

what do you use for memory?

link

mbh159 139 days ago

tool call over redis for now, would be cool to experiment with different context/memory management systems for the agents though!

link