---
base_model:
- BarraHome/Mistroll-7B-v2.2
- ClaudioItaly/Evolutionstory
library_name: transformers
tags:
- mergekit
- merge
license: mit
---

# merge

I finally think I managed to make a model with high writing skills. Based on the prompt it manages to be very coherent with the story. 

It also has great RAG capabilities.


This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).

The AI ​​model “ClaudioItaly/Evolutionstory-7B-v2.2” has achieved interesting ratings in several metrics, but also shows some areas for improvement. Here is an analysis of the main findings and implications:

Strengths:
IFEval (0-Shot): Scored a very solid 48.14 in strict accuracy. This indicates that the model handles text generation tasks well without the need for prior examples, demonstrating good immediate comprehension capabilities.
BBH (3-Shot): The score of 31.62 in this 3-shot dataset (where the model receives a few examples before responding) suggests that the model is able to effectively leverage additional context to improve performance.
Areas of Improvement:
Math and Complex Reasoning (MATH Lvl 5, 4-Shot): A score of 6.42 on this advanced math level test highlights that the model struggles with complex logic or math problems, which is typical of many general language models, which do not they are optimized for solving numerical or structured problems.

## Merge Details
### Merge Method

This model was merged using the SLERP merge method.

### Models Merged

The following models were included in the merge:
* [BarraHome/Mistroll-7B-v2.2](https://huggingface.co./BarraHome/Mistroll-7B-v2.2)
* [ClaudioItaly/Evolutionstory](https://huggingface.co./ClaudioItaly/Evolutionstory)

### Configuration

The following YAML configuration was used to produce this model:

```yaml
models:
  - model: BarraHome/Mistroll-7B-v2.2
  - model: ClaudioItaly/Evolutionstory
merge_method: slerp
base_model: ClaudioItaly/Evolutionstory
dtype: bfloat16
parameters:
  t: [0, 0.5, 1, 0.5, 0] # V shaped curve: Hermes for input & output, WizardMath in the middle layers

```