BramVanroy
/

GEITje-7B-ultra-sft

Text Generation

alignment-handbook

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

BramVanroy commited on Dec 6, 2024

Commit

901800e

·

verified ·

1 Parent(s): 71e1b8e

Update README.md

Files changed (1) hide show

README.md +16 -0

README.md CHANGED Viewed

@@ -30,6 +30,22 @@ This model is a fine-tuned version of [Rijgersberg/GEITje-7B](https://huggingfac
 > Note that this model has not been aligned with DPO or other techniques. In practice, it is therefore recommended to use the [DPO variant](https://huggingface.co/BramVanroy/GEITje-7B-ultra) of this model.
 ## Model description
 This model is a SFT (chat-tuned) version of [Rijgersberg/GEITje-7B](https://huggingface.co/Rijgersberg/GEITje-7B), which in turn is based on Mistral 7B and further pretrained on Dutch data.

 > Note that this model has not been aligned with DPO or other techniques. In practice, it is therefore recommended to use the [DPO variant](https://huggingface.co/BramVanroy/GEITje-7B-ultra) of this model.
+## Citation
+If you use GEITje 7B Ultra (SFT) or any of its derivatives or quantizations, place cite the following paper:
+```bibtex
+@misc{vanroy2024geitje7bultraconversational,
+      title={GEITje 7B Ultra: A Conversational Model for Dutch},
+      author={Bram Vanroy},
+      year={2024},
+      eprint={2412.04092},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2412.04092},
+}
+```
 ## Model description
 This model is a SFT (chat-tuned) version of [Rijgersberg/GEITje-7B](https://huggingface.co/Rijgersberg/GEITje-7B), which in turn is based on Mistral 7B and further pretrained on Dutch data.