Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
anas-awadalla
/
mpt-1b-redpajama-200b-dolly
like
0
Text Generation
Transformers
PyTorch
togethercomputer/RedPajama-Data-1T
mosaic_gpt
custom_code
arxiv:
2302.13971
arxiv:
2205.14135
arxiv:
2108.12409
License:
cc-by-sa-3.0
Model card
Files
Files and versions
Community
2
Train
Use this model
main
mpt-1b-redpajama-200b-dolly
2 contributors
History:
5 commits
anas-awadalla
turn attention_mask to bool in forward pass
f0a13e4
10 months ago
.gitattributes
1.48 kB
initial commit
12 months ago
README.md
4.62 kB
init
12 months ago
attention.py
13.8 kB
init
12 months ago
config.json
1.14 kB
init
12 months ago
configuration_mosaic_gpt.py
8.87 kB
init
12 months ago
generation_config.json
91 Bytes
init
12 months ago
gpt_blocks.py
3.11 kB
init
12 months ago
low_precision_layernorm.py
1.27 kB
init
12 months ago
mosaic_gpt.py
20.4 kB
turn attention_mask to bool in forward pass
10 months ago
param_init_fns.py
15.9 kB
init
12 months ago
pytorch_model.bin
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
What is a pickle import?
5.25 GB
LFS
init
12 months ago
special_tokens_map.json
99 Bytes
init
12 months ago
tokenizer.json
2.11 MB
init
12 months ago
tokenizer_config.json
366 Bytes
init
12 months ago