Hacker News new | ask | show | jobs
by mckirk 475 days ago
That's a really interesting idea! As others have mentioned, the one thing I'd change is giving the AIs randomized names, disconnected from their model.

On the other hand, it would also be quite cool to see whether, at some point, the 'smarter' LLMs start realizing that they can probably easily mislead and manipulate their simpler cousins with fewer parameters. So maybe a separate leaderboard with openly visible model names?

1 comments

Thanks for the feedback! I'll work on giving the players randomized names.
If you want, you could even use that opportunity for some research into AI bias: Are e.g. players with commonly female-interpreted names more or less often suspected of being Mafia than others, and how does the name the AI is given influence its playstyle? (You could maybe separate out these effects by replacing the names of the other players before passing it to the AIs :D)

Stuff along those lines, could be interesting :)