ppo-MontezumaRevenge-v5-1 / ppo-MontezumaRevenge /_stable_baselines3_version
therealagni's picture
Upload PPO MontezumaRevenge-v5 trained agent 1M timesteps, CNN 0.01 LR nsteps 4096
6fb822a
1.6.2