nikniksen commited on
Commit
8249bb5
1 Parent(s): 514c5b7

nikniksen/tmjgpt-ft_v2

Browse files
README.md CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 3.6001
20
 
21
  ## Model description
22
 
@@ -51,16 +51,16 @@ The following hyperparameters were used during training:
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:-----:|:----:|:---------------:|
54
- | 1.0965 | 1.0 | 1 | 4.2308 |
55
- | 1.0965 | 2.0 | 2 | 4.1805 |
56
- | 1.0598 | 3.0 | 3 | 4.0628 |
57
- | 0.9855 | 4.0 | 4 | 3.9432 |
58
- | 0.9225 | 5.0 | 5 | 3.8365 |
59
- | 0.8697 | 6.0 | 6 | 3.7503 |
60
- | 0.834 | 7.0 | 7 | 3.6862 |
61
- | 0.8089 | 8.0 | 8 | 3.6415 |
62
- | 0.7915 | 9.0 | 9 | 3.6136 |
63
- | 0.7801 | 10.0 | 10 | 3.6001 |
64
 
65
 
66
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 3.3234
20
 
21
  ## Model description
22
 
 
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:-----:|:----:|:---------------:|
54
+ | 1.9423 | 1.0 | 1 | 3.9858 |
55
+ | 1.9404 | 2.0 | 2 | 3.9314 |
56
+ | 1.974 | 3.0 | 3 | 3.8068 |
57
+ | 1.8247 | 4.0 | 4 | 3.6860 |
58
+ | 1.7804 | 5.0 | 5 | 3.5791 |
59
+ | 1.6929 | 6.0 | 6 | 3.4900 |
60
+ | 1.6617 | 7.0 | 7 | 3.4212 |
61
+ | 1.5972 | 8.0 | 8 | 3.3715 |
62
+ | 1.5936 | 9.0 | 9 | 3.3391 |
63
+ | 1.5556 | 10.0 | 10 | 3.3234 |
64
 
65
 
66
  ### Framework versions
adapter_config.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "alpha_pattern": {},
3
  "auto_mapping": null,
4
- "base_model_name_or_path": "TheBloke/Mistral-7B-Instruct-v0.2-GPTQ",
5
  "bias": "none",
6
  "fan_in_fan_out": false,
7
  "inference_mode": true,
 
1
  {
2
  "alpha_pattern": {},
3
  "auto_mapping": null,
4
+ "base_model_name_or_path": null,
5
  "bias": "none",
6
  "fan_in_fan_out": false,
7
  "inference_mode": true,
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bc124ea76613c67eb146acb768344d64be4559265893317200bd111b234afe2c
3
- size 8397056
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:73b46bd249ad6aa13afe65bed1fb6e4136a1fe20a96912644200fc842f816fa5
3
+ size 8398144
runs/Mar16_21-17-20_2ce4a61b58cb/events.out.tfevents.1710623845.2ce4a61b58cb.459.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3d12ea1853e33e53d010633074180b54849a09953b19c2ea444d9bbdc238d1ae
3
+ size 10283
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:60d3a37ed0fccd6452738c94666b0ccc8781ccf6e6210802d203fa0c0090caf6
3
  size 4856
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1575bd5a612d0247c56162721f5b47b655da2ddc4c136e3b52bcd2c34ce4ac79
3
  size 4856