Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
alnrg2arg
/
test3_sft_4bit_dpo
like
0
Text Generation
Transformers
Safetensors
Intel/orca_dpo_pairs
English
mistral
text-generation-inference
unsloth
trl
conversational
Inference Endpoints
License:
cc-by-nc-4.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
test3_sft_4bit_dpo
Commit History
Update README.md
a5263a5
verified
alnrg2arg
commited on
Jan 27
Upload MistralForCausalLM
83e6912
verified
alnrg2arg
commited on
Jan 27
Upload tokenizer
7e96266
verified
alnrg2arg
commited on
Jan 27
Upload README.md with huggingface_hub
805a479
verified
alnrg2arg
commited on
Jan 27
initial commit
e5fe6da
verified
alnrg2arg
commited on
Jan 27