appvoid
/

arco-2

appvoid commited on Sep 22, 2024

Commit

4545113

verified ·

1 Parent(s): 687c6a7

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,42 +1,18 @@
 ---
-base_model:
-- appvoid/arco-original
-- appvoid/arco
 library_name: transformers
 tags:
 - mergekit
 - merge
 ---
-# layout-2
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
-## Merge Details
-### Merge Method
-This model was merged using the passthrough merge method.
-### Models Merged
-The following models were included in the merge:
-* [appvoid/arco-original](https://huggingface.co/appvoid/arco-original)
-* [appvoid/arco](https://huggingface.co/appvoid/arco)
-### Configuration
-The following YAML configuration was used to produce this model:
-```yaml
-slices:
-  - sources:
-    - model: appvoid/arco
-      layer_range: [0, 14]
-  - sources:
-    - model: appvoid/arco-original
-      layer_range: [14, 16]
-merge_method: passthrough
-dtype: float16
-```

 ---
 library_name: transformers
 tags:
 - mergekit
 - merge
 ---
+# arco 2
+This is a merge of arco with an experimental model. As you can see, it dramatically improved on arc challenge, only missing 2 points to get to the level of modern 3b baseline performance.
+| Parameters | Model                          | MMLU  | ARC-C | HellaSwag | PIQA   | Winogrande | Average |
+| -----------|--------------------------------|-------|-------|-----------|--------|------------|---------|
+| 0.5b       | qwen2                          |44.13| 28.92| 49.05    | 69.31 | 56.99  | 49.68  |
+| 0.5b       | arco (original)                |24.41  | 38.23 | 59.21     | 74.27  | 59.59      | 51.14   |
+| 0.5b       | qwen2.5                        |**47.29**|31.83|52.17|70.29|57.06|51.72|
+| 0.5b       | arco                           |26.17|37.29|62.88|74.37|**62.27**|52.60|
+| 0.5b       | arco 2                         |25.51|**38.82**|**63.02**|**74.70**|61.25|**52.66**|