Hacker News new | ask | show | jobs
by cr4zy 3443 days ago
We currently don't look at being shot at (although it's an interesting suggestion!) - the current reward function is basically this:

  distance_reward - off_road_penalty - speeding_penalty - slow_penalty - discomfort_penalty;
Plus if you collide or drive against traffic, the episode terminates.

https://github.com/openai/universe-windows-envs/blob/f5aad96...