Also unrelated, but hopefully not flamebait: the code blocks run off my screen on an iPad because they’re too long. Having just fixed this issue on my own website yesterday, a bit of unsolicited advice: add “overflow: auto” to them and constrain them to the standard margins for your article.
Why is Facebook getting into reinforcement learning? Are there applications of RL to improving social connections or extracting more data to sell to advertisers?
I'd imagine that a company which exists entirely to collect huge amounts of data might have some interest in useful ways to automatically process that data.