thegallier commited on
Commit
0bc0b4b
1 Parent(s): 3128f47

Training in progress, epoch 5

Browse files
README.md ADDED
@@ -0,0 +1,55 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: mistralai/Mistral-7B-Instruct-v0.3
3
+ library_name: peft
4
+ license: apache-2.0
5
+ tags:
6
+ - trl
7
+ - sft
8
+ - generated_from_trainer
9
+ model-index:
10
+ - name: mistral-v0.3-Fine-tunedByTheGallier
11
+ results: []
12
+ ---
13
+
14
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
+ should probably proofread and complete it, then remove this comment. -->
16
+
17
+ # mistral-v0.3-Fine-tunedByTheGallier
18
+
19
+ This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.3](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.3) on the None dataset.
20
+
21
+ ## Model description
22
+
23
+ More information needed
24
+
25
+ ## Intended uses & limitations
26
+
27
+ More information needed
28
+
29
+ ## Training and evaluation data
30
+
31
+ More information needed
32
+
33
+ ## Training procedure
34
+
35
+ ### Training hyperparameters
36
+
37
+ The following hyperparameters were used during training:
38
+ - learning_rate: 0.0002
39
+ - train_batch_size: 4
40
+ - eval_batch_size: 8
41
+ - seed: 42
42
+ - gradient_accumulation_steps: 16
43
+ - total_train_batch_size: 64
44
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
+ - lr_scheduler_type: cosine
46
+ - training_steps: 250
47
+ - mixed_precision_training: Native AMP
48
+
49
+ ### Framework versions
50
+
51
+ - PEFT 0.12.0
52
+ - Transformers 4.44.0
53
+ - Pytorch 2.3.1+cu121
54
+ - Datasets 2.21.0
55
+ - Tokenizers 0.19.1
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9d3a4e07d83914136b5000bd782a648d51f3a340537e6de179d9ddae66dd4b89
3
  size 13648432
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:01fe3b741bc0304f147df85a4c9ee3337754bd0a8a799c848c7d2c61bd7ab0ab
3
  size 13648432
runs/Aug18_20-05-20_587671eac8f6/events.out.tfevents.1724012041.587671eac8f6.3895.1 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4ea720f3d5314d0af8c6437656b1ae0ddc5f2170dd4fadc8ea26ae455cb8934a
3
- size 7639
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:13b42a400d915dcf30a8be6fded21a5dcb028b55a95cd7066efaed5a909f23f9
3
+ size 9526
runs/Aug18_21-02-08_587671eac8f6/events.out.tfevents.1724014972.587671eac8f6.3895.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7d689aa014a28f1dc9d28827990a2058c2a7bf191aaeb4926754c18c16f8cdec
3
+ size 5982
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6e66ad37135a32156acdfabd9d053c4acdaa1670f9722be5e703f4da84e81fb5
3
  size 5496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fe6bc9e3f7c5a853544b0f1d9a3c390fe7d7f25d4418e107e9a6a0a78a7ab9ab
3
  size 5496