tianyuz commited on
Commit
f7c7127
1 Parent(s): 2be40b8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -16
README.md CHANGED
@@ -50,22 +50,7 @@ The name `youri` comes from the Japanese word [`妖狸/ようり/Youri`](https:/
50
 
51
  # Benchmarking
52
 
53
- Evaluation experiments suggest that rinna's `youri-7b` series outperforms other open-source Japanese LLMs on Japanese tasks according to our runs.
54
-
55
- | Model | Model type | 4-task score | 6-task score | 8-task score |
56
- | :-- | :-- | :-- | :-- | :-- |
57
- | rinna/youri-7b-instruction | SFT | 83.88 | 80.93 | 63.63 |
58
- | rinna/youri-7b-chat | SFT | 78.29 | 78.47 | 62.18 |
59
- | matsuo-lab/weblab-10b-instruction-sft | SFT | 78.75 | 75.05 | 59.11 |
60
- | **rinna/youri-7b** | **pre-trained** | **73.32** | **74.58** | **58.87** |
61
- | stabilityai/japanese-stablelm-instruct-alpha-7b | SFT | 70.10 | 71.32 | 54.71 |
62
- | elyza/ELYZA-japanese-Llama-2-7b | pre-trained | 71.72 | 69.28 | 53.17 |
63
- | elyza/ELYZA-japanese-Llama-2-7b-instruct | SFT | 70.57 | 68.12 | 53.14 |
64
- | stabilityai/japanese-stablelm-base-alpha-7b | pre-trained | 61.03 | 65.83 | 51.05 |
65
- | matsuo-lab/weblab-10b | pre-trained | 66.33 | 65.58 | 50.74 |
66
- | meta/llama2-7b | pre-trained | 56.33 | 54.80 | 42.97 |
67
- | rinna/japanese-gpt-neox-3.6b | pre-trained | 47.20 | 54.68 | 41.80 |
68
- | rinna/bilingual-gpt-neox-4b | pre-trained | 46.60 | 52.04 | 40.03 |
69
 
70
  ---
71
 
 
50
 
51
  # Benchmarking
52
 
53
+ Please refer to [rinna's LM benchmark page](https://rinnakk.github.io/research/benchmarks/lm/index.html).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
54
 
55
  ---
56