|
|
|
|
|
by RSchaeffer
2644 days ago
|
|
This is going to sound cynical, but I recently invested a week in rllib for a project before discovering that much of the under-the-hood implementation was horribly confusing, poorly documented and missing critical functionality (for instance, their IMPALA implementation only works with discrete action spaces). Does this library conceal similar problems? |
|
Regarding code quality and ease of use, we follow a strict modular approach with separate components that can be tested individually. Component dataflow is defined on an abstract level, which makes it rather easy to create new components and algorithms. So, instead of having to adjust complex code structures with lots of intertwined behavior, you usually can just plug in another component that covers your use case.