linzaiyun commited on
Commit
c8de6ef
·
verified ·
1 Parent(s): eea2579

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -15
README.md CHANGED
@@ -4,6 +4,14 @@ should probably proofread and complete it, then remove this comment. -->
4
 
5
  # ifable/gemma-2-Ifable-9B
6
 
 
 
 
 
 
 
 
 
7
 
8
  It achieves the following results on the evaluation set:
9
  - Loss: 1.0163
@@ -15,21 +23,6 @@ It achieves the following results on the evaluation set:
15
  - Logps/chosen: -2.1682
16
  - Logits/rejected: -17.0475
17
  - Logits/chosen: -12.0041
18
- - Sft Loss: 0.0184
19
-
20
- ## Model description
21
-
22
- More information needed
23
-
24
- ## Intended uses & limitations
25
-
26
- More information needed
27
-
28
- ## Training and evaluation data
29
-
30
- More information needed
31
-
32
- ## Training procedure
33
 
34
  ### Training hyperparameters
35
 
@@ -61,3 +54,6 @@ The following hyperparameters were used during training:
61
  - Pytorch 2.3.0a0+ebedce2
62
  - Datasets 2.20.0
63
  - Tokenizers 0.19.1
 
 
 
 
4
 
5
  # ifable/gemma-2-Ifable-9B
6
 
7
+ ## Training and evaluation data
8
+
9
+ - Gutenberg: https://huggingface.co/datasets/jondurbin/gutenberg-dpo-v0.1
10
+ - Carefully curated proprietary creative writing dataset
11
+
12
+ ## Training procedure
13
+
14
+ Training method: SimPO (GitHub - princeton-nlp/SimPO: SimPO: Simple Preference Optimization with a Reference-Free Reward)
15
 
16
  It achieves the following results on the evaluation set:
17
  - Loss: 1.0163
 
23
  - Logps/chosen: -2.1682
24
  - Logits/rejected: -17.0475
25
  - Logits/chosen: -12.0041
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
26
 
27
  ### Training hyperparameters
28
 
 
54
  - Pytorch 2.3.0a0+ebedce2
55
  - Datasets 2.20.0
56
  - Tokenizers 0.19.1
57
+
58
+
59
+ We are looking for product manager and operations maganers to build applications through our model, and also open for business cooperation, and also AI engineer to join us, contact with : [email protected]