Pre-training code
#47
by
bilibraker
- opened
Thank you for the awesome model!
Is there a plan to release a pre-training script similar to LLaVA?
Thanks!
We will not open the codebase as it is now complex and consistently changing, but it follows closely the implementation in Transformers, and we use DeepSpeed Zero 3 for the model parallelization.
HugoLaurencon
changed discussion status to
closed