DarshanDeshpande commited on
Commit
ed8e34e
1 Parent(s): 6f83f02

gemma_2b_social_reasoning_reward_model

Browse files
Files changed (3) hide show
  1. README.md +14 -14
  2. adapter_config.json +2 -2
  3. adapter_model.safetensors +1 -1
README.md CHANGED
@@ -20,8 +20,8 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [google/gemma-2b](https://huggingface.co/google/gemma-2b) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.6194
24
- - Accuracy: 0.6766
25
 
26
  ## Model description
27
 
@@ -55,18 +55,18 @@ The following hyperparameters were used during training:
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
57
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
58
- | 0.7925 | 0.24 | 10 | 0.6773 | 0.6182 |
59
- | 0.6702 | 0.48 | 20 | 0.6361 | 0.6532 |
60
- | 0.6146 | 0.72 | 30 | 0.6348 | 0.6673 |
61
- | 0.646 | 0.96 | 40 | 0.6261 | 0.6813 |
62
- | 0.5914 | 1.2 | 50 | 0.6224 | 0.6708 |
63
- | 0.5868 | 1.44 | 60 | 0.6430 | 0.6532 |
64
- | 0.5826 | 1.68 | 70 | 0.6401 | 0.6725 |
65
- | 0.5731 | 1.92 | 80 | 0.6309 | 0.6637 |
66
- | 0.5364 | 2.16 | 90 | 0.6383 | 0.6462 |
67
- | 0.5079 | 2.4 | 100 | 0.6386 | 0.6655 |
68
- | 0.4908 | 2.63 | 110 | 0.6389 | 0.6673 |
69
- | 0.5155 | 2.87 | 120 | 0.6387 | 0.6708 |
70
 
71
 
72
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [google/gemma-2b](https://huggingface.co/google/gemma-2b) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.6131
24
+ - Accuracy: 0.6713
25
 
26
  ## Model description
27
 
 
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
57
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
58
+ | 0.7962 | 0.24 | 10 | 0.6976 | 0.6077 |
59
+ | 0.6657 | 0.48 | 20 | 0.6464 | 0.6217 |
60
+ | 0.6334 | 0.72 | 30 | 0.6330 | 0.6392 |
61
+ | 0.6434 | 0.96 | 40 | 0.6218 | 0.6427 |
62
+ | 0.5745 | 1.2 | 50 | 0.6119 | 0.6567 |
63
+ | 0.5813 | 1.44 | 60 | 0.6282 | 0.6445 |
64
+ | 0.5822 | 1.68 | 70 | 0.6350 | 0.6585 |
65
+ | 0.5914 | 1.92 | 80 | 0.6294 | 0.6550 |
66
+ | 0.5386 | 2.16 | 90 | 0.6341 | 0.6550 |
67
+ | 0.5142 | 2.4 | 100 | 0.6330 | 0.6567 |
68
+ | 0.4906 | 2.63 | 110 | 0.6313 | 0.6620 |
69
+ | 0.5079 | 2.87 | 120 | 0.6309 | 0.6637 |
70
 
71
 
72
  ### Framework versions
adapter_config.json CHANGED
@@ -21,8 +21,8 @@
21
  "rank_pattern": {},
22
  "revision": null,
23
  "target_modules": [
24
- "q_proj",
25
- "v_proj"
26
  ],
27
  "task_type": "SEQ_CLS",
28
  "use_dora": false,
 
21
  "rank_pattern": {},
22
  "revision": null,
23
  "target_modules": [
24
+ "v_proj",
25
+ "q_proj"
26
  ],
27
  "task_type": "SEQ_CLS",
28
  "use_dora": false,
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:665eea71182bd80b48a75e0aa4929d699910e7075c6516833710fd05e56fc2fb
3
  size 14763488
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e11c77d9338dce646bb81953c72b8ec68fc314cb3baf01992e6144695ede31e7
3
  size 14763488