Violet0203 commited on
Commit
c4b6439
โ€ข
1 Parent(s): 9d15ef3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -16,9 +16,9 @@ base_model: KT-AI/midm-bitext-S-7B-inst-v1
16
  - ์ผ๋ฐ˜์ ์œผ๋กœ 1900์Šคํ…์—์„œ๋Š” ์ •ํ™•๋„ accuracy๊ฐ€ 80ํ›„๋ฐ˜๋Œ€(์•ฝ 85%)๊ฐ€ ๋„์ถœ, 2000์Šคํ…์ด์ƒ๋ถ€ํ„ฐ 90%์— ๊ทผ์ ‘ํ•œ ์ˆ˜์น˜๋ฅผ ๋ณด์˜€๋‹ค.
17
  - seq length๋ฅผ 312๋กœ ์ค„์ธ ๊ฒฐ๊ณผ, seq length 384๋ณด๋‹ค ํ›ˆ๋ จ์‹œ๊ฐ„trainer.train์ด ์ ๊ฒŒ ๊ฑธ๋ฆฌ์ง€๋งŒ ์ •ํ™•๋„๋„ ๊ฐ์†Œ
18
  - gradient_accumulation steps์„ 2๋กœ ์„ค์ •ํ•˜์—ฌ ๋ฏธ๋‹ˆ๋ฐฐ์น˜๋ฅผ ํ†ตํ•ด ๊ตฌํ•ด์ง„ gradient๊ฐ’์„ n step๋™์•ˆ
19
- global gradient์— ๋ˆ„์ ์‹œํ‚จ ํ›„ ํ•œ๋ฒˆ์— ์—…๋Žƒ->๋ฐฐ์น˜๋ฅผ ์—ฌ๋Ÿฌ๊ฐœ ์‚ฌ์šฉํ•œ ํšจ๊ณผ๋ฅผ ์ฃผ๋Š” ๋“ฑ ๋…ธ๋ ฅํ•จ.
20
- ##Accuracy ์ •ํ™•๋„ ๋ถ„์„
21
- ###valid_dataset(test dataset 1000๊ฐœ์— ๋Œ€ํ•œ ์ •ํ™•๋„)
22
  *********************************
23
  | | TP | TN |
24
  |:-------------:|:-----:|:----:|
 
16
  - ์ผ๋ฐ˜์ ์œผ๋กœ 1900์Šคํ…์—์„œ๋Š” ์ •ํ™•๋„ accuracy๊ฐ€ 80ํ›„๋ฐ˜๋Œ€(์•ฝ 85%)๊ฐ€ ๋„์ถœ, 2000์Šคํ…์ด์ƒ๋ถ€ํ„ฐ 90%์— ๊ทผ์ ‘ํ•œ ์ˆ˜์น˜๋ฅผ ๋ณด์˜€๋‹ค.
17
  - seq length๋ฅผ 312๋กœ ์ค„์ธ ๊ฒฐ๊ณผ, seq length 384๋ณด๋‹ค ํ›ˆ๋ จ์‹œ๊ฐ„trainer.train์ด ์ ๊ฒŒ ๊ฑธ๋ฆฌ์ง€๋งŒ ์ •ํ™•๋„๋„ ๊ฐ์†Œ
18
  - gradient_accumulation steps์„ 2๋กœ ์„ค์ •ํ•˜์—ฌ ๋ฏธ๋‹ˆ๋ฐฐ์น˜๋ฅผ ํ†ตํ•ด ๊ตฌํ•ด์ง„ gradient๊ฐ’์„ n step๋™์•ˆ
19
+ global gradient์— ๋ˆ„์ ์‹œํ‚จ ํ›„ ํ•œ๋ฒˆ์— ์—…๋Žƒ->๋ฐฐ์น˜๋ฅผ ์—ฌ๋Ÿฌ๊ฐœ ์‚ฌ์šฉํ•œ ํšจ๊ณผ๋ฅผ ์ฃผ๋Š” ๋“ฑ ๋…ธ๋ ฅํ•จ.
20
+ ## Accuracy ์ •ํ™•๋„ ๋ถ„์„
21
+ ##### valid_dataset(test dataset 1000๊ฐœ์— ๋Œ€ํ•œ ์ •ํ™•๋„)
22
  *********************************
23
  | | TP | TN |
24
  |:-------------:|:-----:|:----:|