Difference between episodes and timesteps in Stable Baselines 3

74 Views Asked by sculabob At 27 July 2025 at 23:54

It is somewhat unclear to be how SB3 differentiates between timesteps and episodes.

In the learn function you can only use the "total_timesteps" parameter, and for SB3 this is generally defined as the total number of timesteps the agent will interact with the environment during training. What is a bit weird to me is that during training you do get information about the mean reward and episode length, but I do not know how to figure out how many episodes occur in the simulation and what are the maximum number of timesteps allowed per episode.

Original Q&A

Difference between episodes and timesteps in Stable Baselines 3

There are 0 best solutions below

Related Questions in REINFORCEMENT-LEARNING

Related Questions in STABLE-BASELINES

Trending Questions

Popular # Hahtags

Popular Questions