Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

mosaicml
/
mosaic-bert-base-seqlen-256

Fill-Mask
Transformers
PyTorch
English
bert
custom_code
Model card Files Files and versions Community
2
mosaic-bert-base-seqlen-256
Ctrl+K
Ctrl+K
  • 3 contributors
History: 9 commits
daking's picture
daking
kobindra's picture
kobindra
Create LICENSE (#1)
5137f0d verified about 1 year ago
  • .gitattributes
    1.48 kB
    initial commit about 2 years ago
  • LICENSE
    11.3 kB
    Create LICENSE (#1) about 1 year ago
  • README.md
    13.9 kB
    Clarify how to load model and use ALiBi over 1 year ago
  • bert_layers.py
    47.3 kB
    Upload BertForMaskedLM about 2 years ago
  • bert_padding.py
    6.26 kB
    Upload BertForMaskedLM about 2 years ago
  • config.json
    843 Bytes
    Change attention_probs_dropout_prob to 0.1 so that triton FlashAttention dependencies are avoided over 1 year ago
  • configuration_bert.py
    1.01 kB
    Upload BertForMaskedLM about 2 years ago
  • flash_attn_triton.py
    42.7 kB
    Upload BertForMaskedLM about 2 years ago
  • pytorch_model.bin

    Detected Pickle imports (3)

    • "torch._utils._rebuild_tensor_v2",
    • "collections.OrderedDict",
    • "torch.FloatStorage"

    What is a pickle import?

    550 MB
    LFS
    Upload BertForMaskedLM about 2 years ago