Update modeling_ltgbert.py

by KoichiYasuoka - opened Sep 13

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

-2

KoichiYasuoka

Sep 13

When initializing LtgbertForTokenClassification several LayerNorms don't have weight or bias.

Update modeling_ltgbert.py0d621b73

KoichiYasuoka

Sep 13

And when using transformers>=4.40, two Metaspace's in tokenizer.json need "prepend_scheme" as follows:

      {
        "type": "Metaspace",
        "replacement": "▁",
        "add_prefix_space": false,
        "prepend_scheme": "never"
      },

davda54

HPLT org Sep 13

Hi, thank you very much for reporting these issues! I will look more into it next week. We're still discussing what to do about the Metaspace pretokenizer, its new behavior might silently break more things: https://huggingface.co/HPLT/hplt_bert_base_en/discussions/1

KoichiYasuoka

21 days ago

Thank you @davda54 for new tokenizer.json with https://huggingface.co/HPLT/hplt_bert_base_ja/commit/3ba81b4d5b8885c06c3a0c8f4c7feb79fefee1cb , well, how about modeling_ltgbert.py?

davda54 changed pull request status to merged 18 days ago

davda54

HPLT org 18 days ago

Hi, I'm really sorry that it took me so long! Thank you once again for your fix, it's now applied to the Japanese BERT as well as to other HPLT-BERT models :)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment