Quazim0t0
/

Phi4.Turn.R1Distill_v1.4_Q4_k-GGUF

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Quazim0t0 commited on 15 days ago

Commit

5b6e08a

·

verified ·

1 Parent(s): 2de1730

Update README.md

Files changed (1) hide show

README.md +14 -3

README.md CHANGED Viewed

@@ -9,14 +9,25 @@ tags:
 license: apache-2.0
 language:
 - en
 ---
 # Uploaded  model
 - **Developed by:** Quazim0t0
-- **License:** apache-2.0
 - **Finetuned from model :** unsloth/phi-4-unsloth-bnb-4bit
-This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 license: apache-2.0
 language:
 - en
+datasets:
+- bespokelabs/Bespoke-Stratos-17k
+- bespokelabs/Bespoke-Stratos-35k
+- NovaSky-AI/Sky-T1_data_17k
+- Quazim0t0/BenfordsLawReasoningJSON
+- open-thoughts/OpenThoughts-114k
 ---
 # Uploaded  model
 - **Developed by:** Quazim0t0
 - **Finetuned from model :** unsloth/phi-4-unsloth-bnb-4bit
+- **GGUF**
+- **Trained for 8 Hours on A800 with the Bespoke Stratos 17k Dataset.**
+- **Trained for 6 Hours on A800 with the Bespoke Stratos 35k Dataset.**
+- **Trained for 2 Hours on A800 with the Benford's Law Reasoning Small 430 Row Dataset, ensuring no overfitting.**
+- **Trained for 4 Hours on A800 with the Sky-T1_data_17k Dataset**
+- **Trained for 2 Hours on A800 with the Openthoughts 114k Dataset.**
+- **15$ Training...I'm actually amazed by the results.**
+If using this model for Open WebUI here is a simple function to organize the models responses: https://openwebui.com/f/quaz93/phi4_turn_r1_distill_thought_function_v1