vwxyzjn commited on
Commit
677f602
1 Parent(s): 03c7e29

pushing model

Browse files
README.md ADDED
@@ -0,0 +1,64 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - CartPole-v1
4
+ - deep-reinforcement-learning
5
+ - reinforcement-learning
6
+ - custom-implementation
7
+ library_name: cleanrl
8
+ model-index:
9
+ - name: DQN
10
+ results:
11
+ - task:
12
+ type: reinforcement-learning
13
+ name: reinforcement-learning
14
+ dataset:
15
+ name: CartPole-v1
16
+ type: CartPole-v1
17
+ metrics:
18
+ - type: mean_reward
19
+ value: 36.50 +/- 11.32
20
+ name: mean_reward
21
+ verified: false
22
+ ---
23
+
24
+ # (CleanRL) **DQN** Agent Playing **CartPole-v1**
25
+
26
+ This is a trained model of a DQN agent playing CartPole-v1.
27
+ The model was trained by using [CleanRL](https://github.com/vwxyzjn/cleanrl) and the most up-to-date training code can be
28
+ found [here](https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl/dqn_jax.py).
29
+
30
+ ## Command to reproduce the training
31
+
32
+ ```bash
33
+ curl -OL https://huggingface.co/vwxyzjn/CartPole-v1-dqn_jax-seed1/raw/main/dqn.py
34
+ curl -OL https://huggingface.co/vwxyzjn/CartPole-v1-dqn_jax-seed1/raw/main/pyproject.toml
35
+ curl -OL https://huggingface.co/vwxyzjn/CartPole-v1-dqn_jax-seed1/raw/main/poetry.lock
36
+ poetry install --all-extras
37
+ python dqn_jax.py --save-model --upload-model --hf-entity vwxyzjn --total-timesteps 1000
38
+ ```
39
+
40
+ # Hyperparameters
41
+ ```python
42
+ {'batch_size': 128,
43
+ 'buffer_size': 10000,
44
+ 'capture_video': False,
45
+ 'end_e': 0.05,
46
+ 'env_id': 'CartPole-v1',
47
+ 'exp_name': 'dqn_jax',
48
+ 'exploration_fraction': 0.5,
49
+ 'gamma': 0.99,
50
+ 'hf_entity': 'vwxyzjn',
51
+ 'learning_rate': 0.00025,
52
+ 'learning_starts': 10000,
53
+ 'save_model': True,
54
+ 'seed': 1,
55
+ 'start_e': 1,
56
+ 'target_network_frequency': 500,
57
+ 'total_timesteps': 1000,
58
+ 'track': False,
59
+ 'train_frequency': 10,
60
+ 'upload_model': True,
61
+ 'wandb_entity': None,
62
+ 'wandb_project_name': 'cleanRL'}
63
+ ```
64
+
dqn_jax.cleanrl_model ADDED
Binary file (43.9 kB). View file
 
events.out.tfevents.1671208305.pop-os.1580220.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fb054c09e3c7a920037cf9695dbb707fdc85ce5da6bcb1195825bcf793fb1da5
3
+ size 5677
replay.mp4 ADDED
Binary file (5.85 kB). View file
 
videos/CartPole-v1__dqn_jax__1__1671208305-eval/rl-video-episode-0.mp4 ADDED
Binary file (4.04 kB). View file
 
videos/CartPole-v1__dqn_jax__1__1671208305-eval/rl-video-episode-1.mp4 ADDED
Binary file (6.25 kB). View file
 
videos/CartPole-v1__dqn_jax__1__1671208305-eval/rl-video-episode-8.mp4 ADDED
Binary file (5.85 kB). View file