|
|
|
|
|
by ericye16
478 days ago
|
|
This was the basis of a project I did for my deep reinforcement learning class! https://ericye16.com/stanford-cs224r We were able to make some improvements by tuning how the reward is distributed and also by first pretraining the agent on scales before fine-tuning them on the final pieces. Thanks to Kevin Zakka for helping us get started with the RL environment! |
|