Update README.md
Browse files
README.md
CHANGED
@@ -4,14 +4,14 @@ base_model: KT-AI/midm-bitext-S-7B-inst-v1
|
|
4 |
tags:
|
5 |
- generated_from_trainer
|
6 |
model-index:
|
7 |
-
- name:
|
8 |
results: []
|
9 |
---
|
10 |
|
11 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
12 |
should probably proofread and complete it, then remove this comment. -->
|
13 |
|
14 |
-
#
|
15 |
|
16 |
This model is a fine-tuned version of [KT-AI/midm-bitext-S-7B-inst-v1](https://huggingface.co/KT-AI/midm-bitext-S-7B-inst-v1) on an unknown dataset.
|
17 |
|
@@ -46,6 +46,14 @@ The following hyperparameters were used during training:
|
|
46 |
|
47 |
### Training results
|
48 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
49 |
|
50 |
|
51 |
### Framework versions
|
@@ -54,3 +62,5 @@ The following hyperparameters were used during training:
|
|
54 |
- Pytorch 2.1.0+cu118
|
55 |
- Datasets 2.15.0
|
56 |
- Tokenizers 0.15.0
|
|
|
|
|
|
4 |
tags:
|
5 |
- generated_from_trainer
|
6 |
model-index:
|
7 |
+
- name: midm-7B-nsmc
|
8 |
results: []
|
9 |
---
|
10 |
|
11 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
12 |
should probably proofread and complete it, then remove this comment. -->
|
13 |
|
14 |
+
# midm-7B-nsmc
|
15 |
|
16 |
This model is a fine-tuned version of [KT-AI/midm-bitext-S-7B-inst-v1](https://huggingface.co/KT-AI/midm-bitext-S-7B-inst-v1) on an unknown dataset.
|
17 |
|
|
|
46 |
|
47 |
### Training results
|
48 |
|
49 |
+
- global_step=300
|
50 |
+
- training_loss=1.1095316060384115
|
51 |
+
- metrics={'train_runtime': 1012.8423,
|
52 |
+
'train_samples_per_second': 0.592,
|
53 |
+
'train_steps_per_second': 0.296,
|
54 |
+
'total_flos': 9315508499251200.0,
|
55 |
+
'train_loss': 1.1095316060384115,
|
56 |
+
'epoch': 0.3}
|
57 |
|
58 |
|
59 |
### Framework versions
|
|
|
62 |
- Pytorch 2.1.0+cu118
|
63 |
- Datasets 2.15.0
|
64 |
- Tokenizers 0.15.0
|
65 |
+
|
66 |
+
### Accuracy
|