How to do continue-pre-training on the 7B-Instruct model?

#13

by YalunHu - opened Jul 12, 2024

Jul 12, 2024

To improve the code-generation/code-completion ability, I wanna do a continue-pre-training on this instructed version model, how should I make my pre-training data? Just add "<|endoftext|>" token at the end of each chunk of code-text?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment