|
|
|
|
|
by MrQuincle
4764 days ago
|
|
Good point. There are so many benchmarks that can be performed. POMDP (Partial Observable Markov Decision Processes) literature has a lot of benchmarks (bandits, etc.). Reinforcement literature has many. There are many standard problems in nonlinear control theory solved, not only the inverted pendulum. It is common knowledge that an algorithm that performs well on task A, will be outperformed on task B. What are the tasks B in Hawkins book? In what do these specific type of recurrent networks excel, and in what do they sink as a brick? |
|