| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by pigscantfly 3628 days ago
	I work on a few algorithms that could be classified as RL given an open mind. Most of them learn distributions from streaming data via some kind of online EM. I know that people in the ad-serving, porn-serving, and website optimization (A/B stuff) sectors use RL pretty extensively as well, but I'm not one of them at the moment.

1 comments

> learn distributions from streaming data

That's unsupervised learning afaik - clustering, manifolds etc. Where is "reinforcement" part there (agent, environment, reward)?