Show HN: Multi-GPU Reinforcement Learning in Tensorflow for OpenAI Gym

Y	Hacker News new \| ask \| show \| jobs

	Show HN: Multi-GPU Reinforcement Learning in Tensorflow for OpenAI Gym (github.com)
	58 points by seasonedschemer 3628 days ago

1 comments

rozguil 3628 days ago

A bit off topic, but how many people here use rl in their day job, and, if you use it, what do you use it for?

link

kmike84 3628 days ago

We're using it for web crawling: define what to look for (a reward function), and crawler can learn how to get these pages from the web without wasting too much HTTP requests for irrelevant content. No neural nets, just Q-Learning with linear function approximation, with some common tricks like double learning and experience replay.

link

pigscantfly 3628 days ago

I work on a few algorithms that could be classified as RL given an open mind. Most of them learn distributions from streaming data via some kind of online EM. I know that people in the ad-serving, porn-serving, and website optimization (A/B stuff) sectors use RL pretty extensively as well, but I'm not one of them at the moment.

link

Dzugaru 3628 days ago

> learn distributions from streaming data

That's unsupervised learning afaik - clustering, manifolds etc. Where is "reinforcement" part there (agent, environment, reward)?

link

aab0 3628 days ago

If you use RL, you might not know it. Multi-armed bandits, for example.

link

Dzugaru 3628 days ago

Used simple RL (finite states, no neural nets) for gamedev - finding optimal policies for player behavior helps in fixing balance issues.

link

huevosabio 3628 days ago

+1, I also am curious of how prevalent is RL in industry.

link

assface 3628 days ago

RL is huge in pornography.

link