Hacker News new | ask | show | jobs
by orasis 523 days ago
Thompson Sampling is trivial to implement, especially with binary rewards. ChatGPT can do it reliably from scratch.