Update README.md
Browse files
README.md
CHANGED
@@ -22,7 +22,7 @@ Qwen-0.5B-GRPO is designed to serve as a lightweight math reasoning assistant. B
|
|
22 |
- **Generation Engine:** Utilizes vLLM for faster inference on a single GPU setup
|
23 |
- **Precision:** BF16 training for efficiency on Colab GPUs
|
24 |
|
25 |
-
- **Developed by:**
|
26 |
- **License:** Please refer to the license of the base model on its Hugging Face Hub page
|
27 |
|
28 |
### Model Sources
|
|
|
22 |
- **Generation Engine:** Utilizes vLLM for faster inference on a single GPU setup
|
23 |
- **Precision:** BF16 training for efficiency on Colab GPUs
|
24 |
|
25 |
+
- **Developed by:** Davut Emre Taşar
|
26 |
- **License:** Please refer to the license of the base model on its Hugging Face Hub page
|
27 |
|
28 |
### Model Sources
|