jmeadows17
/

DerivationGeneration8B

Model card Files Files and versions Community

jmeadows17 commited on 27 days ago

Commit

fe95774

•

1 Parent(s): 85265ae

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -5,11 +5,11 @@ library_name: peft
 **Overview**
-DerivationGeneration8B is a QLoRA fine-tuned from a quantised LLaMa-3.1-8B checkpoint on 15K (LaTeX) synthetic mathematical derivations (containing 4 - 10 equations) via a custom early stopping script using ROUGE as the validation metric (total 6 epochs). This approach outperforms [MathT5](https://huggingface.co/jmeadows17/MathT5-large) in both in-distribution and perturbed evaluation cases presented in related work ```https://arxiv.org/abs/2307.09998```.
 **How to use**
-A notebook for inference is available [here](https://github.com/jmeadows17/deriving-equations-with-LLMs/blob/main/llama_evaluation.ipynb). Training scripts are also available in the repository.
 **Example prompt**

 **Overview**
+DerivationGeneration8B is a QLoRA fine-tuned from a quantised LLaMa-3.1-8B checkpoint on 15K (LaTeX) synthetic mathematical derivations (containing 4 - 10 equations) via a custom [script](https://github.com/jmeadows17/deriving-equations-with-LLMs/blob/main/llama_3.1_qlora_sft.py) using ROUGE as the validation metric for early stopping (total 6 epochs). This approach outperforms [MathT5](https://huggingface.co/jmeadows17/MathT5-large) in both in-distribution and perturbed evaluation cases presented in [related work](https://arxiv.org/abs/2307.09998).
 **How to use**
+A notebook for inference is available [here](https://github.com/jmeadows17/deriving-equations-with-LLMs/blob/main/llama_evaluation.ipynb).
 **Example prompt**