Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
aisingapore
/
sea-lion-3b
like
12
Text Generation
Transformers
Safetensors
11 languages
mpt
custom_code
text-generation-inference
arxiv:
2101.09635
License:
mit
Model card
Files
Files and versions
Community
3
Train
Deploy
Use this model
main
sea-lion-3b
4 contributors
History:
47 commits
RaymondAISG
Update generation_config.json
4c4edab
verified
about 1 month ago
.gitattributes
1.52 kB
initial commit
7 months ago
LICENSE
1.06 kB
Update LICENSE
7 months ago
README.md
6.38 kB
Update README.md
about 2 months ago
adapt_tokenizer.py
1.72 kB
Add 3B model files
7 months ago
attention.py
21.6 kB
Add 3B model files
7 months ago
blocks.py
2.84 kB
Add 3B model files
7 months ago
config.json
1.27 kB
Add 3B model files
7 months ago
configuration_mpt.py
11 kB
Add 3B model files
7 months ago
custom_embedding.py
292 Bytes
Add 3B model files
7 months ago
fc.py
167 Bytes
Add 3B model files
7 months ago
ffn.py
1.75 kB
Add 3B model files
7 months ago
flash_attn_triton.py
28.2 kB
Add 3B model files
7 months ago
generation_config.json
91 Bytes
Update generation_config.json
about 1 month ago
hf_prefixlm_converter.py
11.4 kB
Update codes to be in line with LLM-foundry update on October 30, 2023
6 months ago
meta_init_context.py
3.96 kB
Add 3B model files
7 months ago
model.safetensors
6.36 GB
LFS
Add 3B model files
7 months ago
modeling_mpt.py
24.2 kB
Add 3B model files
7 months ago
norm.py
3.12 kB
Add 3B model files
7 months ago
param_init_fns.py
11.9 kB
Add 3B model files
7 months ago
special_tokens_map.json
59 Bytes
Add 3B model files
7 months ago
tokenization_SEA_BPE.py
7.8 kB
Add 3B model files
7 months ago
tokenizer.model
4.57 MB
LFS
Update tokenizer.model for GGUF quantization
about 1 month ago
tokenizer_config.json
795 Bytes
Add 3B model files
7 months ago