Update modeling_ltgbert.py

by KoichiYasuoka - opened Sep 13, 2024

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

-2

KoichiYasuoka

Sep 13, 2024

When initializing LtgbertForTokenClassification several LayerNorms don't have weight or bias.

Update modeling_ltgbert.py0d621b73

KoichiYasuoka

Sep 13, 2024

And when using transformers>=4.40, two Metaspace's in tokenizer.json need "prepend_scheme" as follows:

      {
        "type": "Metaspace",
        "replacement": "▁",
        "add_prefix_space": false,
        "prepend_scheme": "never"
      },

davda54

HPLT org Sep 13, 2024

Hi, thank you very much for reporting these issues! I will look more into it next week. We're still discussing what to do about the Metaspace pretokenizer, its new behavior might silently break more things: https://huggingface.co./HPLT/hplt_bert_base_en/discussions/1

KoichiYasuoka

Nov 22, 2024

Thank you @davda54 for new tokenizer.json with https://huggingface.co./HPLT/hplt_bert_base_ja/commit/3ba81b4d5b8885c06c3a0c8f4c7feb79fefee1cb , well, how about modeling_ltgbert.py?

davda54 changed pull request status to merged Nov 24, 2024

davda54

HPLT org Nov 24, 2024

Hi, I'm really sorry that it took me so long! Thank you once again for your fix, it's now applied to the Japanese BERT as well as to other HPLT-BERT models :)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment