ppo-LunarLander-v2 / results.json
alvarobb's picture
Pushing v1 of agent trained with PPO in LunarLander
423efc3
raw
history blame
163 Bytes
{"mean_reward": 243.5892510662182, "std_reward": 21.21631109384525, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-12-28T17:38:20.458424"}