Hacker News new | ask | show | jobs
by kaushikbokka 264 days ago
The future of RL observability could look like this:

you’re working alongside your model, spawning multiple versions of your environment by tweaking components at different points, much like using git worktrees for RL experiments.