Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
aisingapore
/
sea-lion-3b
like
16
Text Generation
Transformers
Safetensors
11 languages
mpt
custom_code
text-generation-inference
Inference Endpoints
arxiv:
2101.09635
License:
mit
Model card
Files
Files and versions
Community
3
Train
Deploy
Use this model
refs/pr/1
sea-lion-3b
4 contributors
History:
32 commits
weiqipedia
Minor fixes for README.md
fee5443
12 months ago
.gitattributes
1.52 kB
initial commit
12 months ago
LICENSE
1.06 kB
Update LICENSE
12 months ago
README.md
4.34 kB
Minor fixes for README.md
12 months ago
adapt_tokenizer.py
1.72 kB
Add 3B model files
12 months ago
attention.py
21.6 kB
Add 3B model files
12 months ago
blocks.py
2.84 kB
Add 3B model files
12 months ago
config.json
1.27 kB
Add 3B model files
12 months ago
configuration_mpt.py
11 kB
Add 3B model files
12 months ago
custom_embedding.py
292 Bytes
Add 3B model files
12 months ago
fc.py
167 Bytes
Add 3B model files
12 months ago
ffn.py
1.75 kB
Add 3B model files
12 months ago
flash_attn_triton.py
28.2 kB
Add 3B model files
12 months ago
generation_config.json
91 Bytes
Add 3B model files
12 months ago
hf_prefixlm_converter.py
27.6 kB
Add 3B model files
12 months ago
meta_init_context.py
3.96 kB
Add 3B model files
12 months ago
model.safetensors
6.36 GB
LFS
Add 3B model files
12 months ago
modeling_mpt.py
24.2 kB
Add 3B model files
12 months ago
norm.py
3.12 kB
Add 3B model files
12 months ago
param_init_fns.py
11.9 kB
Add 3B model files
12 months ago
special_tokens_map.json
59 Bytes
Add 3B model files
12 months ago
tokenization_SEA_BPE.py
7.8 kB
Add 3B model files
12 months ago
tokenizer.model
4.57 MB
LFS
Add 3B model files
12 months ago
tokenizer_config.json
795 Bytes
Add 3B model files
12 months ago