Weyaxi
/

OpenHermes-2.5-neural-chat-v3-3-Slerp

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

OpenHermes-2.5-neural-chat-v3-3-Slerp

This is the model for OpenHermes-2.5-neural-chat-v3-3-Slerp. I used mergekit to merge models.

Prompt Templates

You can use these prompt templates, but I recommend using ChatML.

ChatML (OpenHermes-2.5-Mistral-7B):

<|im_start|>system
{system}<|im_end|>
<|im_start|>user
{user}<|im_end|>
<|im_start|>assistant
{asistant}<|im_end|>

neural-chat-7b-v3-3

### System:
{system}
### User:
{user}
### Assistant:

Yaml Config to reproduce

slices:
  - sources:
      - model: teknium/OpenHermes-2.5-Mistral-7B
        layer_range: [0, 32]
      - model: Intel/neural-chat-7b-v3-3
        layer_range: [0, 32]
merge_method: slerp
base_model: mistralai/Mistral-7B-v0.1
parameters:
  t:
    - filter: self_attn
      value: [0, 0.5, 0.3, 0.7, 1]
    - filter: mlp
      value: [1, 0.5, 0.7, 0.3, 0]
    - value: 0.5 # fallback for rest of tensors
dtype: bfloat16

Quantizationed versions

Quantizationed versions of this model is available thanks to TheBloke.

GPTQ

TheBloke/OpenHermes-2.5-neural-chat-v3-3-Slerp-GPTQ

GGUF

TheBloke/OpenHermes-2.5-neural-chat-v3-3-Slerp-GGUF

AWQ

TheBloke/OpenHermes-2.5-neural-chat-v3-3-Slerp-AWQ

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	71.38
ARC (25-shot)	68.09
HellaSwag (10-shot)	86.2
MMLU (5-shot)	64.26
TruthfulQA (0-shot)	62.78
Winogrande (5-shot)	79.16
GSM8K (5-shot)	67.78

If you would like to support me:

☕ Buy Me a Coffee

Downloads last month: 462

Safetensors

Model size

7.24B params

Tensor type

BF16

·

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for Weyaxi/OpenHermes-2.5-neural-chat-v3-3-Slerp

Intel/neural-chat-7b-v3-3

teknium/OpenHermes-2.5-Mistral-7B

Merge model

this model

Adapters

Merges

Quantizations

Space using Weyaxi/OpenHermes-2.5-neural-chat-v3-3-Slerp 1

Collection including Weyaxi/OpenHermes-2.5-neural-chat-v3-3-Slerp

Merge models

Merge models created with mergekit • 19 items • Updated Mar 16 • 2

Evaluation results

normalized accuracy on AI2 Reasoning Challenge (25-Shot)
test set self-reported

68.090
normalized accuracy on HellaSwag (10-Shot)
validation set self-reported

86.200
accuracy on MMLU (5-Shot)
test set self-reported

64.260
mc2 on TruthfulQA (0-shot)
validation set self-reported

62.780
accuracy on Winogrande (5-shot)
validation set self-reported

79.160
accuracy on GSM8k (5-shot)
test set self-reported

67.780

View on Papers With Code