Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
cerebras
/
btlm-3b-8k-base
like
260
Text Generation
Transformers
PyTorch
cerebras/SlimPajama-627B
English
btlm
causal-lm
Cerebras
BTLM
custom_code
6 papers
License:
apache-2.0
Model card
Files
Files and versions
Community
27
Train
Use this model
refs/pr/3
btlm-3b-8k-base
3 contributors
History:
7 commits
rskuzma
add README
8015326
10 months ago
.gitattributes
1.52 kB
initial commit
10 months ago
README.md
11.7 kB
add README
10 months ago
config.json
1.24 kB
change mup param names
10 months ago
configuration_btlm.py
7.58 kB
change mup param names
10 months ago
generation_config.json
119 Bytes
add bfloat16 checkpoint
10 months ago
merges.txt
456 kB
add the tokenizer
10 months ago
modeling_btlm.py
71.2 kB
change mup param names
10 months ago
pytorch_model.bin
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"torch.BFloat16Storage"
,
"collections.OrderedDict"
What is a pickle import?
5.29 GB
LFS
add bfloat16 checkpoint
10 months ago
special_tokens_map.json
99 Bytes
add the tokenizer
10 months ago
tokenizer_config.json
234 Bytes
add the tokenizer
10 months ago
vocab.json
1.04 MB
add the tokenizer
10 months ago