File size: 6,407 Bytes
edee5ed 9e6a514 edee5ed 203c6d0 f13be14 edee5ed 203c6d0 edee5ed 9e6a514 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 |
---
language:
- ko
- en
base_model:
- meta-llama/Llama-3.2-1B-Instruct
---
> @ 2024.10.07 Model [torchtorchkimtorch/Llama-3.2-Korean-GGACHI-1B-Instruct-v1](https://huggingface.co./torchtorchkimtorch/Llama-3.2-Korean-GGACHI-1B-Instruct-v1) Released!
> @ 2024.10.18 Performance for KOBEST of Llama-3.2-Korean-GGACHI-1B-Instruct-v1 has been updated!
# **GGACHI-1B-version1** #
![Image Description](๊น์น.png)
## ๋ชจ๋ธ ์ค๋ช
(Model Description)
GGACHI-1B-Instruct-v1๋ Llama-3.2-1B-Instruct ๋ชจ๋ธ์ ๊ธฐ๋ฐ์ผ๋ก ํ๋ ํ๊ตญ์ด ํ์คํฌ ์ํ์ ์ต์ ํ๋ instruction-tuned ์ธ์ด ๋ชจ๋ธ์
๋๋ค. 230,000๊ฐ ์ด์์ ๊ณ ํ์ง ํ๊ตญ์ด ๋ฐ์ดํฐ์
์ ์ฌ์ฉํ์ฌ fine-tuning๋์์ต๋๋ค.
GGACHI-1B-Instruct-v1 is an instruction-tuned language model optimized for Korean language tasks, based on the Llama-3.2-1B-Instruct model. It has been fine-tuned using over 230,000 high-quality Korean language datasets.
## ๋ชจ๋ธ ์ฑ๋ฅ (Model Performance)
#### - 0 shot ####
<table style="width:100%; text-align:center; border-collapse:collapse;">
<thead>
<tr>
<th style="border:1px solid black;">Task</th>
<th style="border:1px solid black;">Model</th>
<th style="border:1px solid black;">Accuracy</th>
</tr>
</thead>
<tbody>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_boolq</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;"><strong>0.502</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;"><strong>0.502</td>
</tr>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_copa</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;">0.504</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;"><strong>0.521</strong></td>
</tr>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_hellaswag</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;">0.358</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;"><strong>0.380</td>
</tr>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_sentineg</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;">0.476</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;"><strong>0.594</strong></td>
</tr>
</tbody>
</table>
#### - 5 shot ####
<table style="width:100%; text-align:center; border-collapse:collapse;">
<thead>
<tr>
<th style="border:1px solid black;">Task</th>
<th style="border:1px solid black;">Model</th>
<th style="border:1px solid black;">Accuracy</th>
</tr>
</thead>
<tbody>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_boolq</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;"><strong>0.571</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;">0.565</td>
</tr>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_copa</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;">0.526</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;"><strong>0.549</strong></td>
</tr>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_hellaswag</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;">0.364</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;"><strong>0.398</td>
</tr>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_sentineg</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;">0.725</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;"><strong>0.795</strong></td>
</tr>
</tbody>
</table>
#### - 10 shot ####
<table style="width:100%; text-align:center; border-collapse:collapse;">
<thead>
<tr>
<th style="border:1px solid black;">Task</th>
<th style="border:1px solid black;">Model</th>
<th style="border:1px solid black;">Accuracy</th>
</tr>
</thead>
<tbody>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_boolq</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;"><strong>0.593</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;">0.571</td>
</tr>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_copa</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;">0.525</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;"><strong>0.549</strong></td>
</tr>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_hellaswag</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;">0.356</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;"><strong>0.394</td>
</tr>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_sentineg</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;">0.768</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;"><strong>0.821</strong></td>
</tr>
</tbody>
</table> |