|
|
|
|
|
by westurner
144 days ago
|
|
Task: play tetris Task: write and optimize a tetris bot Task: write and safely online optimize a tetris bot with consideration for cost to converge openai/baselines (7 years ago) was leading on RL and then AlphaZero and Self-Attention Transformer networks. LLMs are trained with RL, but aren't general purpose game theoretic RL agents? |
|
"Outsmarting algorithms: A comparative battle between Reinforcement Learning and heuristics in Atari Tetris" (2025) https://dl.acm.org/doi/10.1016/j.eswa.2025.127251