torchtorchkimtorch's picture
Update README.md
9e6a514 verified
|
raw
history blame
6.49 kB
---
language:
- ko
- en
base_model:
- meta-llama/Llama-3.2-1B-Instruct
---
> @ 2024.10.07 Model [torchtorchkimtorch/Llama-3.2-Korean-GGACHI-1B-Instruct-v1](https://huggingface.co./torchtorchkimtorch/Llama-3.2-Korean-GGACHI-1B-Instruct-v1) Released!
> @ 2024.10.18 Performance for KOBEST of Llama-3.2-Korean-GGACHI-1B-Instruct-v1 has been updated!
> @ Announcements) Llama-3.2-Korean-GGACHI-1B-Instruct-v2 is set to be released soon.
# **GGACHI-1B-version1** #
![Image Description](๊นŒ์น˜.png)
## ๋ชจ๋ธ ์„ค๋ช… (Model Description)
GGACHI-1B-Instruct-v1๋Š” Llama-3.2-1B-Instruct ๋ชจ๋ธ์„ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•˜๋Š” ํ•œ๊ตญ์–ด ํƒœ์Šคํฌ ์ˆ˜ํ–‰์— ์ตœ์ ํ™”๋œ instruction-tuned ์–ธ์–ด ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค. 230,000๊ฐœ ์ด์ƒ์˜ ๊ณ ํ’ˆ์งˆ ํ•œ๊ตญ์–ด ๋ฐ์ดํ„ฐ์…‹์„ ์‚ฌ์šฉํ•˜์—ฌ fine-tuning๋˜์—ˆ์Šต๋‹ˆ๋‹ค.
GGACHI-1B-Instruct-v1 is an instruction-tuned language model optimized for Korean language tasks, based on the Llama-3.2-1B-Instruct model. It has been fine-tuned using over 230,000 high-quality Korean language datasets.
## ๋ชจ๋ธ ์„ฑ๋Šฅ (Model Performance)
#### - 0 shot ####
<table style="width:100%; text-align:center; border-collapse:collapse;">
<thead>
<tr>
<th style="border:1px solid black;">Task</th>
<th style="border:1px solid black;">Model</th>
<th style="border:1px solid black;">Accuracy</th>
</tr>
</thead>
<tbody>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_boolq</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;"><strong>0.502</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;"><strong>0.502</td>
</tr>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_copa</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;">0.504</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;"><strong>0.521</strong></td>
</tr>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_hellaswag</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;">0.358</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;"><strong>0.380</td>
</tr>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_sentineg</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;">0.476</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;"><strong>0.594</strong></td>
</tr>
</tbody>
</table>
#### - 5 shot ####
<table style="width:100%; text-align:center; border-collapse:collapse;">
<thead>
<tr>
<th style="border:1px solid black;">Task</th>
<th style="border:1px solid black;">Model</th>
<th style="border:1px solid black;">Accuracy</th>
</tr>
</thead>
<tbody>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_boolq</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;"><strong>0.571</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;">0.565</td>
</tr>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_copa</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;">0.526</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;"><strong>0.549</strong></td>
</tr>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_hellaswag</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;">0.364</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;"><strong>0.398</td>
</tr>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_sentineg</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;">0.725</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;"><strong>0.795</strong></td>
</tr>
</tbody>
</table>
#### - 10 shot ####
<table style="width:100%; text-align:center; border-collapse:collapse;">
<thead>
<tr>
<th style="border:1px solid black;">Task</th>
<th style="border:1px solid black;">Model</th>
<th style="border:1px solid black;">Accuracy</th>
</tr>
</thead>
<tbody>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_boolq</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;"><strong>0.593</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;">0.571</td>
</tr>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_copa</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;">0.525</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;"><strong>0.549</strong></td>
</tr>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_hellaswag</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;">0.356</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;"><strong>0.394</td>
</tr>
<tr>
<td rowspan="2" style="border:1px solid black;">kobest_sentineg</td>
<td style="border:1px solid black;">Llama-3.2-1B-Instruct</td>
<td style="border:1px solid black;">0.768</td>
</tr>
<tr>
<td style="border:1px solid black;"><strong>GGACHI</strong></td>
<td style="border:1px solid black;"><strong>0.821</strong></td>
</tr>
</tbody>
</table>