ifable
/

gemma-2-Ifable-9B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

linzaiyun commited on Sep 12, 2024

Commit

c8de6ef

·

verified ·

1 Parent(s): eea2579

Update README.md

Files changed (1) hide show

README.md +11 -15

README.md CHANGED Viewed

@@ -4,6 +4,14 @@ should probably proofread and complete it, then remove this comment. -->
 # ifable/gemma-2-Ifable-9B
 It achieves the following results on the evaluation set:
 - Loss: 1.0163
@@ -15,21 +23,6 @@ It achieves the following results on the evaluation set:
 - Logps/chosen: -2.1682
 - Logits/rejected: -17.0475
 - Logits/chosen: -12.0041
-- Sft Loss: 0.0184
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
@@ -61,3 +54,6 @@ The following hyperparameters were used during training:
 - Pytorch 2.3.0a0+ebedce2
 - Datasets 2.20.0
 - Tokenizers 0.19.1

 # ifable/gemma-2-Ifable-9B
+## Training and evaluation data
+- Gutenberg: https://huggingface.co/datasets/jondurbin/gutenberg-dpo-v0.1
+- Carefully curated proprietary creative writing dataset
+## Training procedure
+Training method: SimPO (GitHub - princeton-nlp/SimPO: SimPO: Simple Preference Optimization with a Reference-Free Reward)
 It achieves the following results on the evaluation set:
 - Loss: 1.0163
 - Logps/chosen: -2.1682
 - Logits/rejected: -17.0475
 - Logits/chosen: -12.0041
 ### Training hyperparameters
 - Pytorch 2.3.0a0+ebedce2
 - Datasets 2.20.0
 - Tokenizers 0.19.1
+We are looking for product manager and operations maganers to build applications through our model, and also open for business cooperation, and also AI engineer to join us, contact with : [email protected]