Update README.md
Browse files
README.md
CHANGED
@@ -46,11 +46,11 @@ We evaluate the alignment performance of FLM-2-52B-Instruct-2407 in Chinese acro
|
|
46 |
|
47 |
| Models | Overall | Math. | Logi. | Fund. | Chi. | Open. | Writ. | Role. | Pro. |
|
48 |
| ----------------------- | ------- | ----- | ----- | ----- | ---- | ----- | ----- | ----- | ---- |
|
49 |
-
| gpt-4-1106-preview | 7.58 | 7.39 | 6.83 | 7.69
|
50 |
-
| gpt-4-0613 |
|
51 |
-
| gpt-3.5-turbo-0613 | 5.68
|
52 |
-
| chatglm-turbo | 6.36
|
53 |
-
| FLM-2-52B-Instruct-2407 | 6.23
|
54 |
|
55 |
|
56 |
# Citation
|
|
|
46 |
|
47 |
| Models | Overall | Math. | Logi. | Fund. | Chi. | Open. | Writ. | Role. | Pro. |
|
48 |
| ----------------------- | ------- | ----- | ----- | ----- | ---- | ----- | ----- | ----- | ---- |
|
49 |
+
| gpt-4-1106-preview | **7.58** | **7.39** | **6.83** | **7.69** |<u>7.07</u>| **8.66** | **8.23** | **8.08** | **8.55** |
|
50 |
+
| gpt-4-0613 | <u>6.83</u> |<u>6.33</u>|<u>5.15</u>| 7.16 | 6.76 | 7.26 | 7.31 | 7.48 | 7.56 |
|
51 |
+
| gpt-3.5-turbo-0613 | 5.68 | 4.90 | 4.79 | 6.01 | 5.60 | 6.97 | 7.27 | 6.98 | 6.29 |
|
52 |
+
| chatglm-turbo | 6.36 | 4.88 | 5.09 |<u>7.50</u>| 7.03 |<u>8.45</u>| 8.05 | 7.67 | 7.70 |
|
53 |
+
| FLM-2-52B-Instruct-2407 | 6.23 | 3.79 |<u>5.15</u>| **7.69** | **7.86** |<u>8.45</u>|<u>8.17</u>|<u>7.88</u>|<u>7.85</u>|
|
54 |
|
55 |
|
56 |
# Citation
|