Update README.md
Browse files
README.md
CHANGED
@@ -122,6 +122,11 @@ Here are some results:
|
|
122 |
* Scores #6 in CoPa
|
123 |
* Scores #2 in PiQA
|
124 |
* Scores #9 in BoolQ
|
|
|
|
|
|
|
|
|
|
|
125 |
|
126 |
Many evaluations were performed, but it behaves very balanced in multiple fields. Feel free to submit more evaluation results.
|
127 |
|
|
|
122 |
* Scores #6 in CoPa
|
123 |
* Scores #2 in PiQA
|
124 |
* Scores #9 in BoolQ
|
125 |
+
| Model | Average ⬆️| ARC (25-s) ⬆️ | HellaSwag (10-s) ⬆️ | MMLU (5-s) ⬆️| TruthfulQA (MC) (0-s) ⬆️ | Winogrande (5-s) | GSM8K (5-s) | DROP (3-s) |
|
126 |
+
| --- | --- | --- | --- | --- | --- | --- | --- | --- |
|
127 |
+
|[mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) | 50.32 | 59.58 | 83.31 | 64.16 | 42.15 | 78.37 | 18.12 | 6.14 |
|
128 |
+
| [Intel/neural-chat-7b-v3-1](https://huggingface.co/Intel/neural-chat-7b-v3-1) | 59.0 | 66.21 | 83.64 | 62.37 | 59.65 | 78.14 | 19.56 | 43.84 |
|
129 |
+
| [fblgit/juanako-7b-UNA](https://huggingface.co/fblgit/juanako-7b-UNA) | **65.10** | **68.09** | **85.20** | 61.37 | **65.49** | 76.8 | **48.98** | **49.8** |
|
130 |
|
131 |
Many evaluations were performed, but it behaves very balanced in multiple fields. Feel free to submit more evaluation results.
|
132 |
|