|
|
|
|
|
by cr4zy
3443 days ago
|
|
We currently don't look at being shot at (although it's an interesting suggestion!) - the current reward function is basically this: distance_reward - off_road_penalty - speeding_penalty - slow_penalty - discomfort_penalty;
Plus if you collide or drive against traffic, the episode terminates.https://github.com/openai/universe-windows-envs/blob/f5aad96... |
|