qgallouedec HF staff commited on
Commit
61f745e
1 Parent(s): 677c690

pushing model

Browse files
README.md CHANGED
@@ -16,7 +16,7 @@ model-index:
16
  type: Hopper-v4
17
  metrics:
18
  - type: mean_reward
19
- value: 12.10 +/- 3.56
20
  name: mean_reward
21
  verified: false
22
  ---
@@ -46,7 +46,7 @@ curl -OL https://huggingface.co/qgallouedec/Hopper-v4-ppo_continuous_action-seed
46
  curl -OL https://huggingface.co/qgallouedec/Hopper-v4-ppo_continuous_action-seed1/raw/main/pyproject.toml
47
  curl -OL https://huggingface.co/qgallouedec/Hopper-v4-ppo_continuous_action-seed1/raw/main/poetry.lock
48
  poetry install --all-extras
49
- python ppo_continuous_action.py --env-id Hopper-v4 --capture-video --save-model --upload-model --hf-entity qgallouedec --total-timesteps 500000
50
  ```
51
 
52
  # Hyperparameters
@@ -74,7 +74,7 @@ python ppo_continuous_action.py --env-id Hopper-v4 --capture-video --save-model
74
  'seed': 1,
75
  'target_kl': None,
76
  'torch_deterministic': True,
77
- 'total_timesteps': 500000,
78
  'track': False,
79
  'update_epochs': 10,
80
  'upload_model': True,
 
16
  type: Hopper-v4
17
  metrics:
18
  - type: mean_reward
19
+ value: 21.51 +/- 36.78
20
  name: mean_reward
21
  verified: false
22
  ---
 
46
  curl -OL https://huggingface.co/qgallouedec/Hopper-v4-ppo_continuous_action-seed1/raw/main/pyproject.toml
47
  curl -OL https://huggingface.co/qgallouedec/Hopper-v4-ppo_continuous_action-seed1/raw/main/poetry.lock
48
  poetry install --all-extras
49
+ python ppo_continuous_action.py --env-id Hopper-v4 --capture-video --save-model --upload-model --hf-entity qgallouedec --total-timesteps 50000
50
  ```
51
 
52
  # Hyperparameters
 
74
  'seed': 1,
75
  'target_kl': None,
76
  'torch_deterministic': True,
77
+ 'total_timesteps': 50000,
78
  'track': False,
79
  'update_epochs': 10,
80
  'upload_model': True,
events.out.tfevents.1712438759.MacBook-Pro-de-Quentin.local.21044.0 → events.out.tfevents.1712439170.MacBook-Pro-de-Quentin.local.21719.0 RENAMED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5418244407f4666bdc208811af1467ea91dc1088ee4494e966f7136197a84e2f
3
- size 364877
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d5b8f6b2f594c04ecbf3ad3744014d4b9e1d6c0a4f048a44644d0a671e30f416
3
+ size 100294
ppo_continuous_action.cleanrl_model CHANGED
Binary files a/ppo_continuous_action.cleanrl_model and b/ppo_continuous_action.cleanrl_model differ
 
replay.mp4 CHANGED
Binary files a/replay.mp4 and b/replay.mp4 differ
 
videos/Hopper-v4__ppo_continuous_action__1__1712438759-eval/rl-video-episode-0.mp4 DELETED
Binary file (42.1 kB)
 
videos/Hopper-v4__ppo_continuous_action__1__1712438759-eval/rl-video-episode-1.mp4 DELETED
Binary file (52.9 kB)
 
videos/Hopper-v4__ppo_continuous_action__1__1712438759-eval/rl-video-episode-8.mp4 DELETED
Binary file (40.4 kB)
 
videos/Hopper-v4__ppo_continuous_action__1__1712439170-eval/rl-video-episode-0.mp4 ADDED
Binary file (159 kB). View file
 
videos/Hopper-v4__ppo_continuous_action__1__1712439170-eval/rl-video-episode-1.mp4 ADDED
Binary file (64.4 kB). View file
 
videos/Hopper-v4__ppo_continuous_action__1__1712439170-eval/rl-video-episode-8.mp4 ADDED
Binary file (45.3 kB). View file