Update README.md
Browse files
README.md
CHANGED
@@ -6,7 +6,49 @@ language:
|
|
6 |
metrics:
|
7 |
- accuracy
|
8 |
base_model:
|
9 |
-
- sthenno
|
10 |
---
|
11 |
|
12 |
-
sthenno/tempesthenno-kto-0205-ckpt80
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
6 |
metrics:
|
7 |
- accuracy
|
8 |
base_model:
|
9 |
+
- sthenno/tempesthenno-icy-0130
|
10 |
---
|
11 |
|
12 |
+
sthenno/tempesthenno-kto-0205-ckpt80
|
13 |
+
|
14 |
+
# tempesthenno-icy-0130
|
15 |
+
|
16 |
+
data:image/s3,"s3://crabby-images/c841f/c841f0e423c630aecac66c2217eaae38ed053f9b" alt=""
|
17 |
+
|
18 |
+
Image source: [Azriel | Fandom](https://fairy-tail-fanfiction.fandom.com/es/wiki/Azriel)
|
19 |
+
|
20 |
+
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
21 |
+
|
22 |
+
## Merge Details
|
23 |
+
### Merge Method
|
24 |
+
|
25 |
+
This model was merged using the [SCE](https://arxiv.org/abs/2408.07990) merge method using sthenno/tempesthenno-nuslerp-0124 as a base.
|
26 |
+
|
27 |
+
### Models Merged
|
28 |
+
|
29 |
+
The following models were included in the merge:
|
30 |
+
* sthenno/tempesthenno-icy-0130-01
|
31 |
+
* sthenno/tempesthenno-icy-0130-02
|
32 |
+
* sthenno/tempesthenno-icy-0130-03
|
33 |
+
|
34 |
+
### Configuration
|
35 |
+
|
36 |
+
The following YAML configuration was used to produce this model:
|
37 |
+
|
38 |
+
```yaml
|
39 |
+
name: tempesthenno-icy-0130
|
40 |
+
merge_method: sce
|
41 |
+
parameters:
|
42 |
+
select_topk: 0.8
|
43 |
+
normalize: true
|
44 |
+
dtype: float32
|
45 |
+
out_dtype: bfloat16
|
46 |
+
base_model: sthenno/tempesthenno-nuslerp-0124
|
47 |
+
tokenizer:
|
48 |
+
source: base
|
49 |
+
chat_template: chatml
|
50 |
+
models:
|
51 |
+
- model: sthenno/tempesthenno-icy-0130-01
|
52 |
+
- model: sthenno/tempesthenno-icy-0130-02
|
53 |
+
- model: sthenno/tempesthenno-icy-0130-03
|
54 |
+
```
|