|
|
|
|
|
by laughingman2
1248 days ago
|
|
Have you checked RSSM approach in DreamerV1,V2,V3,PlaNet? It uses deterministic (GRU hidden state) and discrete stochastic latent states. The deterministic and stochastic (sampled) latent state are used to predict the next state. I think the stochastic state might help with your problem a bit. |
|
Much appreciated. Have a great weekend :)