nelkh commited on
Commit
f497eb0
1 Parent(s): ec2376c

nelkh/pgd_lora_8bits_mistral_1_r4_e4_b3

Browse files
README.md CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.3](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.3) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.4218
20
 
21
  ## Model description
22
 
@@ -49,15 +49,15 @@ The following hyperparameters were used during training:
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
- | 1.2146 | 1.0 | 90 | 0.4449 |
53
- | 0.4336 | 2.0 | 180 | 0.4277 |
54
- | 0.4216 | 3.0 | 270 | 0.4245 |
55
- | 0.4146 | 4.0 | 360 | 0.4218 |
56
 
57
 
58
  ### Framework versions
59
 
60
- - PEFT 0.11.1
61
  - Transformers 4.42.4
62
  - Pytorch 2.3.1+cu121
63
  - Datasets 2.20.0
 
16
 
17
  This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.3](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.3) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.9165
20
 
21
  ## Model description
22
 
 
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
+ | 1.5061 | 1.0 | 90 | 0.9588 |
53
+ | 0.9073 | 2.0 | 180 | 0.9308 |
54
+ | 0.8847 | 3.0 | 270 | 0.9168 |
55
+ | 0.8653 | 4.0 | 360 | 0.9165 |
56
 
57
 
58
  ### Framework versions
59
 
60
+ - PEFT 0.12.0
61
  - Transformers 4.42.4
62
  - Pytorch 2.3.1+cu121
63
  - Datasets 2.20.0
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6c6fdabb38a768a4d35be92a860b3ee964d242b4c66f10368dd9f8c7d574cad6
3
  size 6832592
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:684c26c5db5170ff94b20ff350e0a15a44f31b88647768028e86c62a90bed534
3
  size 6832592
runs/Jul30_12-38-55_adf30a86b1ac/events.out.tfevents.1722343144.adf30a86b1ac.5272.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:25d32b44b0fd08e9b5b97e73a75857d994b4a5185e8fc07e46fffd5faddd4b1a
3
+ size 7640
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:521736074d9c7f8dd468dfb97c0e1926b26a0af499487f5d0b2a017d360c089d
3
  size 5112
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e5c9f70de440a0d56b7a773010b0d8c60e394643c4d249bd7aea9bbaa56e3cf7
3
  size 5112