Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
DarshanDeshpande
/
gemma_2b_social_reasoning_reward_model
like
0
PEFT
Safetensors
trl
reward-trainer
Generated from Trainer
License:
other
Model card
Files
Files and versions
Community
Use this model
main
gemma_2b_social_reasoning_reward_model
Commit History
gemma_2b_social_reasoning_reward_model
ed8e34e
verified
DarshanDeshpande
commited on
Mar 10, 2024
gemma_2b_social_reasoning_reward_model
6f83f02
verified
DarshanDeshpande
commited on
Mar 10, 2024
gemma_2b_social_reasoning_reward_model
7223345
verified
DarshanDeshpande
commited on
Mar 9, 2024
Upload GemmaForSequenceClassification
3bdbd19
verified
DarshanDeshpande
commited on
Mar 9, 2024
Upload GemmaForSequenceClassification
add1c83
verified
DarshanDeshpande
commited on
Mar 8, 2024
Upload GemmaForCausalLM
9f5c0d2
verified
DarshanDeshpande
commited on
Mar 8, 2024
reward_model
764d1f1
verified
DarshanDeshpande
commited on
Mar 7, 2024
initial commit
02e257f
verified
DarshanDeshpande
commited on
Mar 7, 2024