| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by wlamond 3298 days ago
	It'd be interesting if the agents developed their own language during the reinforcement learning stage that is unintelligible to humans but allows them to quickly navigate the negotiation. They use the model trained in a supervised way during the reinforcement learning stage to avoid this, but I'm curious to see what the agent learns when paired against another reinforcement learning agent. Edit: Indeed, the paper says that not using the fixed agent trained on human negotiation leads to unintelligible language from the agents.

2 comments

EGreg 3297 days ago

Can we measure if the language is more efficient at getting deals done?

link

wlamond 3297 days ago

I bet we could. The length of each utterance, the number of exchanges between agents, and the entropy of the symbols used in the utterances could give you some measure of efficiency.

link

jakebasile 3297 days ago

That really seems like the plot to a Michael Crichton novel. Fascinating.

link