Training With EOS
#6
by
assafbk
- opened
Thanks for all of your great work!
During training, have you encountered a situation where the model predicts the EOS token at the beginning of the response?
And if so, were you able to mitigate this behaviour?
Thanks in advance!
mrm8488
changed discussion status to
closed
The model hasn't converged. Continue training