Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
bigcode
/
santacoder
like
325
Follow
BigCode
935
Text Generation
Transformers
PyTorch
bigcode/the-stack
code
gpt2
custom_code
Eval Results
text-generation-inference
Inference Endpoints
arxiv:
1911.02150
arxiv:
2207.14255
arxiv:
2301.03988
License:
bigcode-openrail-m
Model card
Files
Files and versions
Community
45
Train
Deploy
Use this model
refs/pr/42
santacoder
17 contributors
History:
55 commits
loubnabnl
HF staff
add note on fim tokens
cad6f98
about 1 year ago
.gitattributes
Safe
1.48 kB
initial commit
almost 2 years ago
README.md
Safe
9.47 kB
add note on fim tokens
about 1 year ago
config.json
Safe
948 Bytes
Update eos_token_id / bos_token_id in config.json (#19)
over 1 year ago
configuration_gpt2_mq.py
Safe
9.47 kB
Update configuration_gpt2_mq.py (#6)
almost 2 years ago
modeling_gpt2_mq.py
Safe
15.1 kB
Update modeling_gpt2_mq.py
almost 2 years ago
pytorch_model.bin
Safe
pickle
Detected Pickle imports (4)
"torch.ByteStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
,
"collections.OrderedDict"
What is a pickle import?
4.6 GB
LFS
iter_0600000
almost 2 years ago
special_tokens_map.json
Safe
138 Bytes
Update tokenizer (#11)
almost 2 years ago
tokenizer.json
Safe
2.08 MB
Update tokenizer (#11)
almost 2 years ago
tokenizer_config.json
Safe
159 Bytes
Switch from PreTrainedTokenizerFast to GPT2TokenizerFast and add eos_token & bos_token (#15)
almost 2 years ago