Hacker News new | ask | show | jobs
by MrQuincle 4764 days ago
Good point. There are so many benchmarks that can be performed. POMDP (Partial Observable Markov Decision Processes) literature has a lot of benchmarks (bandits, etc.). Reinforcement literature has many. There are many standard problems in nonlinear control theory solved, not only the inverted pendulum. It is common knowledge that an algorithm that performs well on task A, will be outperformed on task B. What are the tasks B in Hawkins book? In what do these specific type of recurrent networks excel, and in what do they sink as a brick?