weqweasdas
/

RM-Gemma-7B

Text Classification

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

weqweasdas commited on Mar 20

Commit

fc1057c

•

1 Parent(s): 028c23e

Update README.md

Files changed (1) hide show

README.md +2 -3

README.md CHANGED Viewed

@@ -8,7 +8,6 @@
 The reward model is trained from the base model [google/gemma-7b-it](https://huggingface.co/google/gemma-7b-it).
-The training process is identical to [RM-Gemma-7B](https://huggingface.co/weqweasdas/RM-Gemma-7B) but with a max-length of 4096 thanks to more GPU resources.
 ## Model Details
@@ -48,11 +47,11 @@ We train the model for one epoch with a learning rate of 5e-6, batch size 256, c
 ```python
   from transformers import AutoTokenizer, pipeline
-  rm_tokenizer = AutoTokenizer.from_pretrained("weqweasdas/RM-Gemma-7B-4096")
   device = 0 # accelerator.device
   rm_pipe = pipeline(
       "sentiment-analysis",
-      model="weqweasdas/RM-Gemma-7B-4096",
       #device="auto",
       device=device,
       tokenizer=rm_tokenizer,

 The reward model is trained from the base model [google/gemma-7b-it](https://huggingface.co/google/gemma-7b-it).
 ## Model Details
 ```python
   from transformers import AutoTokenizer, pipeline
+  rm_tokenizer = AutoTokenizer.from_pretrained("weqweasdas/RM-Gemma-7B")
   device = 0 # accelerator.device
   rm_pipe = pipeline(
       "sentiment-analysis",
+      model="weqweasdas/RM-Gemma-7B",
       #device="auto",
       device=device,
       tokenizer=rm_tokenizer,