--- base_model: - TheDrummer/Rocinante-12B-v1 - Trappu/Nemo-Picaro-12B - anthracite-org/magnum-v4-12b library_name: transformers tags: - mergekit - merge --- # merge This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [Trappu/Nemo-Picaro-12B](https://huggingface.co./Trappu/Nemo-Picaro-12B) as a base. ### Models Merged The following models were included in the merge: * [TheDrummer/Rocinante-12B-v1](https://huggingface.co./TheDrummer/Rocinante-12B-v1) * [anthracite-org/magnum-v4-12b](https://huggingface.co./anthracite-org/magnum-v4-12b) ### Configuration The following YAML configuration was used to produce this model: ```yaml models: - model: Trappu/Nemo-Picaro-12B parameters: density: 0.7 weight: 0.4 - model: anthracite-org/magnum-v4-12b parameters: density: 0.3 weight: 0.3 - model: TheDrummer/Rocinante-12B-v1 parameters: density: 0.5 weight: 0.3 merge_method: ties base_model: Trappu/Nemo-Picaro-12B parameters: normalize: true int8_mask: true dtype: bfloat16 chat_template: "chatml" tokenizer: source: union ```