chaoweihuang commited on
Commit
78a4bb6
1 Parent(s): d3b3fc2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -13
README.md CHANGED
@@ -12,9 +12,22 @@ model-index:
12
  results: []
13
  ---
14
 
15
- Paper: https://huggingface.co/papers/2410.01691.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
16
 
17
- # kto-mix-14k-lf-response-llama3-f1_100_0.8-fg0.5-fgudw4.0-kto-fg
18
 
19
  This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on the trl-lib/kto-mix-14k and the chaoweihuang/lf-response-llama3-f1_100_0.8-fg0.5 datasets.
20
  It achieves the following results on the evaluation set:
@@ -38,17 +51,6 @@ It achieves the following results on the evaluation set:
38
  - Fg Kl: nan
39
  - Fg Loss: 0.7625
40
 
41
- ## Model description
42
-
43
- More information needed
44
-
45
- ## Intended uses & limitations
46
-
47
- More information needed
48
-
49
- ## Training and evaluation data
50
-
51
- More information needed
52
 
53
  ## Training procedure
54
 
 
12
  results: []
13
  ---
14
 
15
+ # FactAlign-LLaMA-3-8B
16
+ This model is aligned with our **FactAlign** framework for improved long-form factuality, from [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct).
17
+
18
+ For more information, please refer to our paper: [FactAlign: Long-form Factuality Alignment of Large Language Models](https://huggingface.co/papers/2410.01691).
19
+
20
+
21
+ ## Model description
22
+
23
+ More information needed
24
+
25
+ ## Intended uses & limitations
26
+
27
+ More information needed
28
+
29
+ ## Training and evaluation data
30
 
 
31
 
32
  This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on the trl-lib/kto-mix-14k and the chaoweihuang/lf-response-llama3-f1_100_0.8-fg0.5 datasets.
33
  It achieves the following results on the evaluation set:
 
51
  - Fg Kl: nan
52
  - Fg Loss: 0.7625
53
 
 
 
 
 
 
 
 
 
 
 
 
54
 
55
  ## Training procedure
56