Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
dpo
Inference Endpoints
AutoTrain Compatible
text-generation-inference
custom_code
4-bit precision
Eval Results
Merge
8-bit precision
Mixture of Experts
Carbon Emissions
Misc with no match
text-embeddings-inference
Apply filters
Models
3,962
Full-text search
Edit filters
Sort: Trending
Active filters:
dpo
Clear all
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-3
Text Generation
•
Updated
Jun 8
•
81
•
5
QuantFactory/NeuralDaredevil-8B-abliterated-GGUF
Text Generation
•
Updated
May 29
•
3.83k
•
51
mradermacher/Phoenix-GGUF
Updated
9 days ago
•
71
•
1
mradermacher/Phoenix-i1-GGUF
Updated
9 days ago
•
108
•
1
nvidia/Llama3-70B-DPO-Chat
Updated
Jun 14
•
8
•
3
mlabonne/TwinLlama-3.1-8B-DPO
Text Generation
•
Updated
Oct 6
•
138
•
3
v000000/L3.1-Niitorm-8B-DPO-t0.0001
Text Generation
•
Updated
Oct 3
•
2.58k
•
7
v000000/L3.1-Niitorm-8B-DPO-t0.0001-GGUFs-IMATRIX
Updated
Oct 6
•
156
•
2
tanliboy/lambda-qwen2.5-14b-dpo-test
Text Generation
•
Updated
Sep 20
•
2.63k
•
7
v000000/Qwen2.5-Lumen-14B
Text Generation
•
Updated
Oct 3
•
2.93k
•
18
v000000/Qwen2.5-14B-Gutenberg-Instruct-Slerpeno
Text Generation
•
Updated
Sep 30
•
2.83k
•
5
QuantFactory/Qwen2.5-Lumen-14B-GGUF
Text Generation
•
Updated
Sep 21
•
214
•
3
mradermacher/Qwen2.5-Lumen-14B-GGUF
Updated
Sep 22
•
124
•
4
tanliboy/lambda-qwen2.5-32b-dpo-test
Text Generation
•
Updated
Sep 22
•
2.64k
•
4
mradermacher/Qwen2.5-Lumen-14B-i1-GGUF
Updated
Sep 22
•
501
•
8
trl-lib/Qwen2-0.5B-DPO
Text Generation
•
Updated
Sep 27
•
83
•
4
HumanLLMs/Human-Like-LLama3-8B-Instruct
Updated
Oct 7
•
65
•
2
pbevan11/Mistral-Nemo-MCAI-SFT-DPO-revision-only
Text Generation
•
Updated
Oct 5
•
29
•
1
HumanLLMs/Human-Like-Qwen2.5-7B-Instruct
Updated
Oct 7
•
60
•
3
mradermacher/Mistral-Nemo-Instruct-MCAI-SFT-DPO-revision-only-GGUF
Updated
9 days ago
•
43
•
1
mradermacher/Mistral-Nemo-Instruct-MCAI-SFT-DPO-revision-only-i1-GGUF
Updated
9 days ago
•
107
•
1
mradermacher/Qwen2.5-14B-Wernicke-DPO-GGUF
Updated
Oct 25
•
75
•
1
mradermacher/Qwen2.5-14B-Wernicke-DPO-i1-GGUF
Updated
Oct 25
•
579
•
3
mradermacher/mistral-7b-dpo-constitutional-ai-GGUF
Updated
Oct 31
•
153
•
1
VAGOsolutions/SauerkrautLM-v2-14b-DPO
Updated
Nov 7
•
582
•
18
andito/SmolLM2-1.7B-Instruct-F16-GGUF
Updated
Oct 31
•
478
•
1
mradermacher/CapybaraHermes-2.5-Mistral-7B-GGUF
Updated
Nov 15
•
50
•
1
mradermacher/CapybaraHermes-2.5-Mistral-7B-i1-GGUF
Updated
Nov 15
•
518
•
1
mradermacher/Humanish-Qwen2.5-7B-Instruct-GGUF
Updated
9 days ago
•
215
•
1
mradermacher/Humanish-Qwen2.5-7B-Instruct-i1-GGUF
Updated
9 days ago
•
640
•
1
Previous
1
2
3
4
...
100
Next