Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
mlabonne
/
TwinLlama-3.1-8B-DPO
like
2
Text Generation
Transformers
Safetensors
English
llama
text-generation-inference
unsloth
trl
dpo
conversational
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
TwinLlama-3.1-8B-DPO
1 contributor
History:
15 commits
mlabonne
Trained with Unsloth
d7cd5a8
verified
13 days ago
.gitattributes
1.52 kB
initial commit
22 days ago
README.md
577 Bytes
Trained with Unsloth
22 days ago
config.json
926 Bytes
Trained with Unsloth
22 days ago
generation_config.json
230 Bytes
Trained with Unsloth
22 days ago
model-00001-of-00004.safetensors
4.98 GB
LFS
Trained with Unsloth
13 days ago
model-00002-of-00004.safetensors
5 GB
LFS
Trained with Unsloth
13 days ago
model-00003-of-00004.safetensors
4.92 GB
LFS
Trained with Unsloth
13 days ago
model-00004-of-00004.safetensors
1.17 GB
LFS
Trained with Unsloth
13 days ago
model.safetensors.index.json
24 kB
Trained with Unsloth
22 days ago
special_tokens_map.json
454 Bytes
Upload tokenizer
22 days ago
tokenizer.json
9.09 MB
Upload tokenizer
22 days ago
tokenizer_config.json
50.9 kB
Upload tokenizer
22 days ago