Nohobby's picture
Upload folder using huggingface_hub
9ffe179 verified
---
base_model:
- intervitens/mini-magnum-12b-v1.1
- UsernameJustAnother/Nemo-12B-Marlin-v2
- nothingiisreal/MN-12B-Celeste-V1.9
library_name: transformers
tags:
- mergekit
- merge
---
# merge
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
## Merge Details
### Merge Method
This model was merged using the [DARE](https://arxiv.org/abs/2311.03099) [TIES](https://arxiv.org/abs/2306.01708) merge method using [nothingiisreal/MN-12B-Celeste-V1.9](https://huggingface.co./nothingiisreal/MN-12B-Celeste-V1.9) as a base.
### Models Merged
The following models were included in the merge:
* [intervitens/mini-magnum-12b-v1.1](https://huggingface.co./intervitens/mini-magnum-12b-v1.1)
* [UsernameJustAnother/Nemo-12B-Marlin-v2](https://huggingface.co./UsernameJustAnother/Nemo-12B-Marlin-v2)
### Configuration
The following YAML configuration was used to produce this model:
```yaml
models:
- model: nothingiisreal/MN-12B-Celeste-V1.9
- model: UsernameJustAnother/Nemo-12B-Marlin-v2
parameters:
density: [0.35, 0.45, 0.5, 0.55, 0.65, 0.55, 0.5, 0.45, 0.35]
weight: [0.165, 0.495, 0.495, 0.165, 0.165, 0.495, 0.495, 0.165]
- model: intervitens/mini-magnum-12b-v1.1
parameters:
density: [0.65, 0.55, 0.5, 0.45, 0.35, 0.45, 0.5, 0.55, 0.65]
weight: [0.495, 0.165, 0.165, 0.495, 0.495, 0.165, 0.165, 0.495]
merge_method: dare_ties
base_model: nothingiisreal/MN-12B-Celeste-V1.9
parameters:
normalize: false
int8_mask: true
dtype: bfloat16
```