geeknix/geeknix_mistral_instruct_test_final

Files changed (4) hide show

README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
 license: apache-2.0
-library_name: transformers
 tags:
 - trl
 - sft
@@ -11,7 +11,6 @@ datasets:
 model-index:
 - name: mistral_instruct_generation
   results: []
-pipeline_tag: text-generation
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -21,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.3](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.3) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7042
 ## Model description
@@ -41,7 +40,7 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 0.0002
-- train_batch_size: 4
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
@@ -51,15 +50,15 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch  | Step | Validation Loss |
-|:-------------:|:------:|:----:|:---------------:|
-| 0.9489        | 0.0524 | 11   | 0.7042          |
 ### Framework versions
 - PEFT 0.11.1
 - Transformers 4.41.1
-- Pytorch 2.1.2
 - Datasets 2.19.1
 - Tokenizers 0.19.1

 ---
 license: apache-2.0
+library_name: peft
 tags:
 - trl
 - sft
 model-index:
 - name: mistral_instruct_generation
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.3](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.3) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4619
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 0.0002
+- train_batch_size: 2
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 0.5925        | 0.05  | 42   | 0.4619          |
 ### Framework versions
 - PEFT 0.11.1
 - Transformers 4.41.1
+- Pytorch 2.3.0+cu121
 - Datasets 2.19.1
 - Tokenizers 0.19.1

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c29836be3414ac0f4bfec343923b802b8aa66da7eba7e7a7af6819bcb06e940e
 size 109069176

 version https://git-lfs.github.com/spec/v1
+oid sha256:44be91ee08e59ffe1ae92aee623b0e99f497350bec862f1e3a1f1ee1c04493ce
 size 109069176

runs/May31_23-54-15_25b160f765e5/events.out.tfevents.1717199669.25b160f765e5.405.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:ba89f110f5da62d82063df37a5b317052d36b036b64df92f2c8b1cff2399ec6f
+size 7608

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5985da9aecb9b8ef82c07213f4a3e6d122a6b0bced8c1f6c192c323783b0e2ed
-size 5176

 version https://git-lfs.github.com/spec/v1
+oid sha256:7fc879a3c1f0fc1f23cf073eafcf74103a7156163da678a3fec0f9059860712a
+size 5112