vishal2304
commited on
Upload folder using huggingface_hub
Browse files- README.md +5 -15
- dqn_cartpole.zip +1 -1
- metadata.yaml +11 -0
- replay.mp4 +0 -0
- results.json +1 -1
README.md
CHANGED
@@ -1,24 +1,14 @@
|
|
1 |
|
2 |
-
|
3 |
-
---
|
4 |
-
tags:
|
5 |
-
- CartPole-v1
|
6 |
-
- q-learning
|
7 |
-
- reinforcement-learning
|
8 |
-
- custom-implementation
|
9 |
-
metrics:
|
10 |
-
- type: mean_reward
|
11 |
-
value: 172.93 +/- 49.55
|
12 |
-
library_name: stable-baselines3
|
13 |
-
---
|
14 |
-
|
15 |
-
|
16 |
# **DQN** Agent playing **CartPole-v1**
|
17 |
This is a trained model of a **DQN** agent playing **CartPole-v1**.
|
18 |
-
|
19 |
## Usage
|
20 |
```python
|
21 |
from stable_baselines3 import DQN
|
22 |
model = DQN.load("dqn_cartpole", env=env)
|
23 |
```
|
|
|
|
|
|
|
|
|
24 |
|
|
|
1 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
2 |
# **DQN** Agent playing **CartPole-v1**
|
3 |
This is a trained model of a **DQN** agent playing **CartPole-v1**.
|
4 |
+
|
5 |
## Usage
|
6 |
```python
|
7 |
from stable_baselines3 import DQN
|
8 |
model = DQN.load("dqn_cartpole", env=env)
|
9 |
```
|
10 |
+
|
11 |
+
## Evaluation Results
|
12 |
+
- Mean Reward: 165.61
|
13 |
+
- Standard Deviation: 36.78
|
14 |
|
dqn_cartpole.zip
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 100110
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f36fcf151eb821b4286ed8f9c31076a45362e72f91b1b691f4a36b081ba38548
|
3 |
size 100110
|
metadata.yaml
ADDED
@@ -0,0 +1,11 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
dataset:
|
2 |
+
name: CartPole-v1
|
3 |
+
type: gymnasium
|
4 |
+
license: mit
|
5 |
+
metrics:
|
6 |
+
mean_reward: 165.61 +/- 36.78
|
7 |
+
tags:
|
8 |
+
- CartPole-v1
|
9 |
+
- q-learning
|
10 |
+
- reinforcement-learning
|
11 |
+
- custom-implementation
|
replay.mp4
CHANGED
Binary files a/replay.mp4 and b/replay.mp4 differ
|
|
results.json
CHANGED
@@ -1 +1 @@
|
|
1 |
-
{"env_id": "CartPole-v1", "mean_reward":
|
|
|
1 |
+
{"env_id": "CartPole-v1", "mean_reward": 165.61, "std_reward": 36.77577871371319, "eval_datetime": "2024-12-18T22:15:25.471841"}
|