teknium commited on
Commit
ccbefea
1 Parent(s): 2bb0b75

Update benchmark comparisons to add Openchat and Jackalope

Browse files
Files changed (1) hide show
  1. README.md +4 -8
README.md CHANGED
@@ -81,17 +81,13 @@ You are to roleplay as Edward Elric from fullmetal alchemist. You are in the wor
81
 
82
  Hermes 2 on Mistral-7B outperforms all Nous & Hermes models of the past, save Hermes 70B, and surpasses most of the current Mistral finetunes across the board.
83
 
84
- ### GPT4All:
85
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/RjgaKLUNMWK5apNn28G18.png)
86
 
87
- ### AGIEval:
88
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/VN4hWrjxABKyC5IJqFR7v.png)
89
-
90
- ### BigBench:
91
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/uQtCdaoHO7Wrs-eIUB7d8.png)
92
 
93
  ### Averages Compared:
94
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/e0dq1UDiUPMbtGR96Ax16.png)
 
95
 
96
  GPT-4All Benchmark Set
97
  ```
 
81
 
82
  Hermes 2 on Mistral-7B outperforms all Nous & Hermes models of the past, save Hermes 70B, and surpasses most of the current Mistral finetunes across the board.
83
 
84
+ ### GPT4All, Bigbench, TruthfulQA, and AGIEval Model Comparisons:
 
85
 
86
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/Kxq4BFEc-d1kSSiCIExua.png)
 
 
 
 
87
 
88
  ### Averages Compared:
89
+
90
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/Q9uexgcbTLcywlYBvORTs.png)
91
 
92
  GPT-4All Benchmark Set
93
  ```