Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
dicta-il
/
dictalm-7b
like
7
Text Generation
Transformers
PyTorch
Safetensors
Hebrew
megatron_gpt
custom_code
arxiv:
2309.14568
License:
cc-by-4.0
Model card
Files
Files and versions
Community
1
Train
Use this model
main
dictalm-7b
2 contributors
History:
6 commits
Shaltiel
SFconvertbot
Adding `safetensors` variant of this model (
#1
)
c233431
verified
4 months ago
.gitattributes
1.52 kB
initial commit
10 months ago
README.md
3.74 kB
Update README.md
10 months ago
config.json
1.01 kB
Upload 11 files
10 months ago
configuration_megatron_gpt.py
9.57 kB
Updated flash attention usage
10 months ago
generation_config.json
132 Bytes
Upload 11 files
10 months ago
merges.txt
1.27 MB
Upload 11 files
10 months ago
model-00001-of-00002.safetensors
9.97 GB
LFS
Adding `safetensors` variant of this model (#1)
4 months ago
model-00002-of-00002.safetensors
950 MB
LFS
Adding `safetensors` variant of this model (#1)
4 months ago
model.safetensors.index.json
41.1 kB
Adding `safetensors` variant of this model (#1)
4 months ago
modeling_megatron_gpt.py
55.1 kB
Updated flash attention usage
10 months ago
pytorch_model-00001-of-00002.bin
pickle
Detected Pickle imports (3)
"torch.HalfStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
9.97 GB
LFS
Upload 11 files
10 months ago
pytorch_model-00002-of-00002.bin
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.HalfStorage"
What is a pickle import?
950 MB
LFS
Upload 11 files
10 months ago
pytorch_model.bin.index.json
39.4 kB
Upload 11 files
10 months ago
special_tokens_map.json
567 Bytes
Upload 11 files
10 months ago
tokenizer_config.json
890 Bytes
Upload 11 files
10 months ago
vocab.json
1.88 MB
Upload 11 files
10 months ago