fix sequence length in santacoder and introduce new model type

#23

by mayank-mishra - opened Mar 13, 2023

base: refs/heads/main

←

from: refs/pr/23

Discussion Files changed

+201

-24

:bug: fix past_length in santacodercbd1dd16

mayank-mishra

BigCode org Mar 13, 2023

•

edited Mar 13, 2023

Adds a new model_type to the config. Currently this is gpt2 which creates problems with huggingface/optimum.
Fix sequence length bug which is not seen in transformers but in ONNX because transformers's generate method passes position_ids itself whereas when running with ONNX, the model needs to infer it itself.

mayank-mishra changed pull request status to open Mar 13, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment