appvoid commited on
Commit
4545113
·
verified ·
1 Parent(s): 687c6a7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -33
README.md CHANGED
@@ -1,42 +1,18 @@
1
  ---
2
- base_model:
3
- - appvoid/arco-original
4
- - appvoid/arco
5
  library_name: transformers
6
  tags:
7
  - mergekit
8
  - merge
9
 
10
  ---
11
- # layout-2
12
 
13
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
14
 
15
- ## Merge Details
16
- ### Merge Method
17
-
18
- This model was merged using the passthrough merge method.
19
-
20
- ### Models Merged
21
-
22
- The following models were included in the merge:
23
- * [appvoid/arco-original](https://huggingface.co/appvoid/arco-original)
24
- * [appvoid/arco](https://huggingface.co/appvoid/arco)
25
-
26
- ### Configuration
27
-
28
- The following YAML configuration was used to produce this model:
29
-
30
- ```yaml
31
- slices:
32
- - sources:
33
- - model: appvoid/arco
34
- layer_range: [0, 14]
35
- - sources:
36
- - model: appvoid/arco-original
37
- layer_range: [14, 16]
38
-
39
- merge_method: passthrough
40
- dtype: float16
41
-
42
- ```
 
1
  ---
 
 
 
2
  library_name: transformers
3
  tags:
4
  - mergekit
5
  - merge
6
 
7
  ---
8
+ # arco 2
9
 
10
+ This is a merge of arco with an experimental model. As you can see, it dramatically improved on arc challenge, only missing 2 points to get to the level of modern 3b baseline performance.
11
 
12
+ | Parameters | Model | MMLU | ARC-C | HellaSwag | PIQA | Winogrande | Average |
13
+ | -----------|--------------------------------|-------|-------|-----------|--------|------------|---------|
14
+ | 0.5b | qwen2 |44.13| 28.92| 49.05 | 69.31 | 56.99 | 49.68 |
15
+ | 0.5b | arco (original) |24.41 | 38.23 | 59.21 | 74.27 | 59.59 | 51.14 |
16
+ | 0.5b | qwen2.5 |**47.29**|31.83|52.17|70.29|57.06|51.72|
17
+ | 0.5b | arco |26.17|37.29|62.88|74.37|**62.27**|52.60|
18
+ | 0.5b | arco 2 |25.51|**38.82**|**63.02**|**74.70**|61.25|**52.66**|