|
|
|
|
|
by spawkfish
3755 days ago
|
|
Training different policies in different styles is a really interesting idea. You could then have a gating process that first chooses the "style" of move to make and then uses the style-specific network to select a move. I think getting data for this could be difficult though. I wonder how easy it would be to automatically categorize a game record by "style"? |
|
If it works, you would be able to perform player vector math ala word2vec. (No idea if it will work)