|
|
|
|
|
by daveguy
3747 days ago
|
|
Raw pixels. And the score. The score was separate, or rather a signal representing increased score. Relevant quote from the paper: "The emulator’s internal state is not observed by the agent; instead it observes an image xt ∈ Rd from the emulator,
which is a vector of raw pixel values representing the current screen. In addition it receives a reward rt representing the change in game score." The paper:
http://arxiv.org/abs/1312.5602 |
|