Hacker News new | ask | show | jobs
by wlamond 3298 days ago
It'd be interesting if the agents developed their own language during the reinforcement learning stage that is unintelligible to humans but allows them to quickly navigate the negotiation. They use the model trained in a supervised way during the reinforcement learning stage to avoid this, but I'm curious to see what the agent learns when paired against another reinforcement learning agent.

Edit: Indeed, the paper says that not using the fixed agent trained on human negotiation leads to unintelligible language from the agents.

2 comments

Can we measure if the language is more efficient at getting deals done?
I bet we could. The length of each utterance, the number of exchanges between agents, and the entropy of the symbols used in the utterances could give you some measure of efficiency.
That really seems like the plot to a Michael Crichton novel. Fascinating.