-
-
-
-
-
-
Inference status
Active filters:
dpo
CultriX/Lama-DPOlphin-8B-Q4_K_M-GGUF
Text Generation
•
Updated
•
13
•
1
mradermacher/Lama-DPOlphin-8B-GGUF
Updated
•
10
•
1
tsavage68/Na_L3_1000steps_1e7rate_01beta_cSFTDPO
Text Generation
•
Updated
•
14
tsavage68/Na_L3_1000steps_1e7rate_03beta_cSFTDPO
Text Generation
•
Updated
•
13
tsavage68/Na_L3_350steps_1e7rate_01beta_cSFTDPO
Text Generation
•
Updated
•
12
tsavage68/Na_L3_250steps_1e7rate_03beta_cSFTDPO
Text Generation
•
Updated
•
12
tsavage68/Na_L3_1000steps_1e7rate_05beta_cSFTDPO
Text Generation
•
Updated
•
15
tsavage68/Na_L3_350steps_1e7rate_05beta_cSFTDPO
Text Generation
•
Updated
•
10
mradermacher/Lama-DPOlphin-8B-i1-GGUF
Updated
•
160
•
1
tsavage68/Na_M2_1000steps_1e7rate_01beta_cSFTDPO
Text Generation
•
Updated
•
13
tsavage68/Na_M2_1000steps_1e7rate_03beta_cSFTDPO
Text Generation
•
Updated
•
12
tsavage68/Na_M2_1000steps_1e7rate_05beta_cSFTDPO
Text Generation
•
Updated
•
14
tsavage68/Na_M2_200steps_1e6rate_01beta_cSFTDPO
Text Generation
•
Updated
•
13
tsavage68/Na_M2_100steps_1e7rate_03beta_cSFTDPO
Text Generation
•
Updated
•
11
SongTonyLi/SFT_D1chosenThenDPO_D2a_Instruct_argilla_math_results
Text Generation
•
Updated
•
11
Jatin313/tiny-chatbot-dpo
Updated
NicholasCorrado/zephyr-7b-uf-dpo-2e
Text Generation
•
Updated
•
10
bartowski/TwinLlama-3.1-8B-DPO3-GGUF
Text Generation
•
Updated
•
23
nomadrp/tq-aya101-6langs
Updated
NicholasCorrado/rlced-conifer-zephyr-7b-dpo-2e
Text Generation
•
Updated
•
39
tsavage68/Na_M2_1000steps_1e8rate_03beta_cSFTDPO
Text Generation
•
Updated
•
12
tsavage68/Na_M2_1000steps_1e6rate_05beta_cSFTDPO
Text Generation
•
Updated
•
14
tsavage68/Na_M2_1000steps_1e8rate_01beta_cSFTDPO
Text Generation
•
Updated
•
14
tsavage68/Na_M2_350steps_1e8rate_03beta_cSFTDPO
Text Generation
•
Updated
•
13
tsavage68/Na_M2_1000steps_1e8rate_05beta_cSFTDPO
Text Generation
•
Updated
•
10
tsavage68/Na_M2_300steps_1e8rate_01beta_cSFTDPO
Text Generation
•
Updated
•
11
NicholasCorrado/zephyr-7b-uf-rlced-conifer-group-dpo-2e
Text Generation
•
Updated
•
15
KoNqUeRoR3891/HW2-dpo
Text Generation
•
Updated
•
14
nomadrp/tq-aya101-gt2
Updated
nomadrp/tq-llama3.1-gt3
Updated