File size: 1,129 Bytes
2cc0b8e
d6afdd9
 
 
 
 
 
 
 
2cc0b8e
d6afdd9
 
2dbd6ff
 
d6afdd9
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
---
base_model:
- SanjiWatsuki/Kunoichi-7B
- SanjiWatsuki/Kunoichi-DPO-v2-7B
library_name: transformers
tags:
- mergekit
- merge

---
# output-model-directory

kuno-kunoichi-v1-DPO-v2-SLERP-7B is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
I'm hoping that the result is more robust against errors, as the two models likely implement comparable reasoning at least somewhat differently.

## Merge Details
### Merge Method

This model was merged using the SLERP merge method.

### Models Merged

The following models were included in the merge:
* [SanjiWatsuki/Kunoichi-7B](https://huggingface.co./SanjiWatsuki/Kunoichi-7B)
* [SanjiWatsuki/Kunoichi-DPO-v2-7B](https://huggingface.co./SanjiWatsuki/Kunoichi-DPO-v2-7B)

### Configuration

The following YAML configuration was used to produce this model:

```yaml
slices:
  - sources:
    - model: SanjiWatsuki/Kunoichi-7B
      layer_range: [0,32]
    - model: SanjiWatsuki/Kunoichi-DPO-v2-7B
      layer_range: [0,32]
merge_method: slerp
base_model: SanjiWatsuki/Kunoichi-7B
parameters:
  t:
    - value: 0.5
dtype: float16

```