Upload folder using huggingface_hub

Browse files

Files changed (7) hide show

README.md +5 -6
README2.md +45 -0
mergekit_config.yml +1 -1
model-00001-of-00004.safetensors +3 -0
model-00002-of-00004.safetensors +3 -0
model-00003-of-00004.safetensors +3 -0
model-00004-of-00004.safetensors +3 -0

README.md CHANGED Viewed

@@ -1,10 +1,9 @@
 ---
 base_model:
-- Replete-AI/Replete-LLM-Qwen2-7b_Beta-Preview
-- MaziyarPanahi/calme-2.8-qwen2-7b
 - arcee-ai/Arcee-Spark
 - Weyaxi/Einstein-v7-Qwen2-7B
-- Qwen/Qwen2-7B
 library_name: transformers
 tags:
 - mergekit
@@ -19,7 +18,7 @@ This is a model created by merging several powerful language models:
 * **Base Model:** [Qwen/Qwen2-7B](https://huggingface.co/Qwen/Qwen2-7B)
 * **Merge Stock:**
-  * [Replete-AI/Replete-LLM-Qwen2-7b_Beta-Preview](https://huggingface.co/Replete-AI/Replete-LLM-Qwen2-7b_Beta-Preview)
   * [MaziyarPanahi/calme-2.8-qwen2-7b](https://huggingface.co/MaziyarPanahi/calme-2.8-qwen2-7b)
   * [arcee-ai/Arcee-Spark](https://huggingface.co/arcee-ai/Arcee-Spark)
   * [Weyaxi/Einstein-v7-Qwen2-7B](https://huggingface.co/Weyaxi/Einstein-v7-Qwen2-7B)
@@ -32,7 +31,7 @@ FocusMix inherits the strengths of its component models, resulting in a model wi
 **Purpose:** aims to provide a powerful and versatile language model that excels in:
-* **Task-Specific Instructions:**  FocusMix can effectively follow specific instructions and complete tasks with higher accuracy.
 * **Complex Reasoning:** The model can handle intricate prompts requiring logical deduction and problem-solving.
 * **Diverse Knowledge Domains:**  FocusMix can engage in conversations and provide information across a wide range of topics.
@@ -44,7 +43,7 @@ The following YAML configuration was used to produce this model:
 merge_method: model_stock
 base_model: Qwen/Qwen2-7B
 models:
-  - model: Replete-AI/Replete-LLM-Qwen2-7b_Beta-Preview
   - model: arcee-ai/Arcee-Spark
   - model: Weyaxi/Einstein-v7-Qwen2-7B
   - model: MaziyarPanahi/calme-2.8-qwen2-7b

 ---
 base_model:
+- Replete-AI/Replete-LLM-Qwen2-7b
 - arcee-ai/Arcee-Spark
 - Weyaxi/Einstein-v7-Qwen2-7B
+- MaziyarPanahi/calme-2.8-qwen2-7b
 library_name: transformers
 tags:
 - mergekit
 * **Base Model:** [Qwen/Qwen2-7B](https://huggingface.co/Qwen/Qwen2-7B)
 * **Merge Stock:**
+  * [Replete-AI/Replete-LLM-Qwen2-7b](https://huggingface.co/Replete-AI/Replete-LLM-Qwen2-7b)
   * [MaziyarPanahi/calme-2.8-qwen2-7b](https://huggingface.co/MaziyarPanahi/calme-2.8-qwen2-7b)
   * [arcee-ai/Arcee-Spark](https://huggingface.co/arcee-ai/Arcee-Spark)
   * [Weyaxi/Einstein-v7-Qwen2-7B](https://huggingface.co/Weyaxi/Einstein-v7-Qwen2-7B)
 **Purpose:** aims to provide a powerful and versatile language model that excels in:
+* **Task-Specific Instructions:**  FocusMix can effectively follow specific instructions and complete tasks with high accuracy.
 * **Complex Reasoning:** The model can handle intricate prompts requiring logical deduction and problem-solving.
 * **Diverse Knowledge Domains:**  FocusMix can engage in conversations and provide information across a wide range of topics.
 merge_method: model_stock
 base_model: Qwen/Qwen2-7B
 models:
+  - model: Replete-AI/Replete-LLM-Qwen2-7b
   - model: arcee-ai/Arcee-Spark
   - model: Weyaxi/Einstein-v7-Qwen2-7B
   - model: MaziyarPanahi/calme-2.8-qwen2-7b

README2.md ADDED Viewed

	@@ -0,0 +1,45 @@

+---
+base_model:
+- Replete-AI/Replete-LLM-Qwen2-7b
+- Qwen/Qwen2-7B
+- MaziyarPanahi/calme-2.8-qwen2-7b
+- arcee-ai/Arcee-Spark
+- Weyaxi/Einstein-v7-Qwen2-7B
+library_name: transformers
+tags:
+- mergekit
+- merge
+---
+# Qwen2-7B-FocusMix
+This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
+## Merge Details
+### Merge Method
+This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [Qwen/Qwen2-7B](https://huggingface.co/Qwen/Qwen2-7B) as a base.
+### Models Merged
+The following models were included in the merge:
+* [Replete-AI/Replete-LLM-Qwen2-7b](https://huggingface.co/Replete-AI/Replete-LLM-Qwen2-7b)
+* [MaziyarPanahi/calme-2.8-qwen2-7b](https://huggingface.co/MaziyarPanahi/calme-2.8-qwen2-7b)
+* [arcee-ai/Arcee-Spark](https://huggingface.co/arcee-ai/Arcee-Spark)
+* [Weyaxi/Einstein-v7-Qwen2-7B](https://huggingface.co/Weyaxi/Einstein-v7-Qwen2-7B)
+### Configuration
+The following YAML configuration was used to produce this model:
+```yaml
+merge_method: model_stock
+base_model: Qwen/Qwen2-7B
+models:
+  - model: Replete-AI/Replete-LLM-Qwen2-7b
+  - model: arcee-ai/Arcee-Spark
+  - model: Weyaxi/Einstein-v7-Qwen2-7B
+  - model: MaziyarPanahi/calme-2.8-qwen2-7b
+dtype: bfloat16
+tokenizer_source: base
+```

mergekit_config.yml CHANGED Viewed

@@ -1,7 +1,7 @@
 merge_method: model_stock
 base_model: Qwen/Qwen2-7B
 models:
-  - model: Replete-AI/Replete-LLM-Qwen2-7b_Beta-Preview
   - model: arcee-ai/Arcee-Spark
   - model: Weyaxi/Einstein-v7-Qwen2-7B
   - model: MaziyarPanahi/calme-2.8-qwen2-7b

 merge_method: model_stock
 base_model: Qwen/Qwen2-7B
 models:
+  - model: Replete-AI/Replete-LLM-Qwen2-7b
   - model: arcee-ai/Arcee-Spark
   - model: Weyaxi/Einstein-v7-Qwen2-7B
   - model: MaziyarPanahi/calme-2.8-qwen2-7b

model-00001-of-00004.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3ffa4ab38a97d1c26dc7c600f199323f519d470cff31302244a4585681421920
+size 4970706328

model-00002-of-00004.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a64737fd28fb1b92f7219d8b4761191445d99eb88d5be0f0df235aa41a5da6fe
+size 4932751032

model-00003-of-00004.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c76156caa4600aaeb8b1446fcfe58d5c35efe55ff47cc00051aa46c85c05cc5d
+size 4991495808

model-00004-of-00004.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:321857587f63d61d75337fed4595ef9206d8d0350592690c73b8a9cd54ffdc2b
+size 330326240