AdamGrzesik
/

Samantha-PL-AG-Mistral-7B-v0.2

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

AdamGrzesik commited on Mar 29

Commit

0b28f24

•

1 Parent(s): 900a6af

Update README.md

Files changed (1) hide show

README.md +3 -28

README.md CHANGED Viewed

@@ -3,7 +3,7 @@ base_model: alpindale/Mistral-7B-v0.2-hf
 tags:
 - generated_from_trainer
 model-index:
-- name: workspace/dolphin-2.8-mistral-7b
   results: []
 ---
@@ -33,7 +33,7 @@ chat_template: chatml
 dataset_prepared_path: last_run_prepared
 val_set_size: 0.001
-output_dir: /workspace/dolphin-2.8-mistral-7b
 sequence_len: 16384
 sample_packing: true
@@ -94,25 +94,12 @@ tokens:
 </details><br>
-# workspace/dolphin-2.8-mistral-7b
 This model is a fine-tuned version of [alpindale/Mistral-7B-v0.2-hf](https://huggingface.co/alpindale/Mistral-7B-v0.2-hf) on the None dataset.
-It achieves the following results on the evaluation set:
-- Loss: 3.8281
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
@@ -131,16 +118,4 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_steps: 10
 - num_epochs: 4
-### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| 3.6131        | 0.09  | 1    | 3.8281          |
-### Framework versions
-- Transformers 4.40.0.dev0
-- Pytorch 2.2.0+cu121
-- Datasets 2.18.0
-- Tokenizers 0.15.0

 tags:
 - generated_from_trainer
 model-index:
+- name: AdamGrzesik/Samantha-PL-AG-Mistral-7B-v0.2
   results: []
 ---
 dataset_prepared_path: last_run_prepared
 val_set_size: 0.001
+output_dir: /workspace/Samantha
 sequence_len: 16384
 sample_packing: true
 </details><br>
+# AdamGrzesik/Samantha-PL-AG-Mistral-7B-v0.2
 This model is a fine-tuned version of [alpindale/Mistral-7B-v0.2-hf](https://huggingface.co/alpindale/Mistral-7B-v0.2-hf) on the None dataset.
 ### Training hyperparameters
 - lr_scheduler_warmup_steps: 10
 - num_epochs: 4