SebastianBodza
/

DElefant-MPT

Model card Files Files and versions Community

SebastianBodza commited on Jul 4, 2023

Commit

a7a4a14

•

1 Parent(s): 1967b9d

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -51,6 +51,8 @@ txt = model.generate(**txt,
                           eos_token_id=tokenizer.eos_token_id)
 tokenizer.decode(txt[0], skip_special_tokens=True)
 ```
 ## Training:
 Training was based on Llama-X with the adaptions of WizardLMs training script and additional adjustments to QLoRa tune. MPT-Code from <a href="https://huggingface.co/SebastianBodza/mpt-30B-qlora-multi_GPU">SebastianBodza/mpt-30B-qlora-multi_GPU</a>

                           eos_token_id=tokenizer.eos_token_id)
 tokenizer.decode(txt[0], skip_special_tokens=True)
 ```
+## Limitations:
+Gradient-Accumulation led to divergence after a couple of steps. Therefore we reduced the blocksize to 1024 and used two RTX 3090 to get a BS of 4. Probably too small to generalize well.
 ## Training:
 Training was based on Llama-X with the adaptions of WizardLMs training script and additional adjustments to QLoRa tune. MPT-Code from <a href="https://huggingface.co/SebastianBodza/mpt-30B-qlora-multi_GPU">SebastianBodza/mpt-30B-qlora-multi_GPU</a>