therealagni commited on
Commit
b616d57
1 Parent(s): 573fd74

Upload PPO MontezumaRevenge-v5 trained agent 1M timesteps, CNN 0.01 LR

Browse files
config.json CHANGED
The diff for this file is too large to render. See raw diff
 
ppo-MontezumaRevenge.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bee4b1a9f1a0d5320eaca2ac8f90a0da3181922c12368a5eac79ea824e8c4918
3
- size 142165271
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9ec2fb8c4b5ad2795ec2e44c70b4efcbd4a02b7151b3fa0d673c4b2c13bc9fc1
3
+ size 142167374
ppo-MontezumaRevenge/data CHANGED
The diff for this file is too large to render. See raw diff
 
ppo-MontezumaRevenge/policy.optimizer.pth CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1f9fea24e7ad738095969017872790ad66c22598ca5c33f34b3423be57f86c58
3
  size 92973881
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8735a1a3877d4ae8afe5962b0918a97d418693a0ca4aebe057b87072c901eb87
3
  size 92973881
ppo-MontezumaRevenge/policy.pth CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bb730158cdfd9f7f4ed27c0b3dc85382c5f688f512c0925bd6095b4ebca06903
3
  size 46486273
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:77e53672489b6743077b89799c3475a363aeed0c6b1c3e1a5c4d5c27c72875d6
3
  size 46486273
replay.mp4 CHANGED
Binary files a/replay.mp4 and b/replay.mp4 differ
 
results.json CHANGED
@@ -1 +1 @@
1
- {"mean_reward": 0.0, "std_reward": 0.0, "is_deterministic": false, "n_eval_episodes": 10, "eval_datetime": "2022-12-27T00:18:43.659596"}
 
1
+ {"mean_reward": 0.0, "std_reward": 0.0, "is_deterministic": false, "n_eval_episodes": 10, "eval_datetime": "2022-12-27T01:08:06.608569"}