mosaicml
/

mosaic-bert-base-seqlen-256

Model card Files Files and versions Community

mosaic-bert-base-seqlen-256

Ctrl+K

Ctrl+K

3 contributors

History: 9 commits

daking's picture

kobindra's picture

Create LICENSE (#1)

5137f0d verified about 1 year ago

.gitattributes

1.48 kB

initial commit about 2 years ago
LICENSE

11.3 kB

Create LICENSE (#1) about 1 year ago
README.md

13.9 kB

Clarify how to load model and use ALiBi over 1 year ago
bert_layers.py

47.3 kB

Upload BertForMaskedLM about 2 years ago
bert_padding.py

6.26 kB

Upload BertForMaskedLM about 2 years ago
config.json

843 Bytes

Change attention_probs_dropout_prob to 0.1 so that triton FlashAttention dependencies are avoided over 1 year ago
configuration_bert.py

1.01 kB

Upload BertForMaskedLM about 2 years ago
flash_attn_triton.py

42.7 kB

Upload BertForMaskedLM about 2 years ago
pytorch_model.bin
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "torch.FloatStorage"
What is a pickle import?
550 MB
LFS

Upload BertForMaskedLM about 2 years ago