--- base_model: - O1-OPEN/OpenO1-LLama-8B-v0.1 - THUDM/LongWriter-llama3.1-8b library_name: transformers tags: - mergekit - merge --- # merge This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [O1-OPEN/OpenO1-LLama-8B-v0.1](https://huggingface.co./O1-OPEN/OpenO1-LLama-8B-v0.1) as a base. ### Models Merged The following models were included in the merge: * [THUDM/LongWriter-llama3.1-8b](https://huggingface.co./THUDM/LongWriter-llama3.1-8b) ### Configuration The following YAML configuration was used to produce this model: ```yaml models: - model: O1-OPEN/OpenO1-LLama-8B-v0.1 parameters: density: 0.7 weight: 0.7 - model: THUDM/LongWriter-llama3.1-8b parameters: density: 0.3 weight: 0.3 merge_method: ties base_model: O1-OPEN/OpenO1-LLama-8B-v0.1 parameters: normalize: false int8_mask: true dtype: bfloat16 ```