Edit model card

A Llama version for Nanbeige/Nanbeige-16B-Base, which could be loaded by LlamaForCausalLM.

Nanbeige-16B is a 16 billion parameter language model developed by Nanbeige LLM Lab. It uses 2.5T Tokens for pre-training. The training data includes a large amount of high-quality internet corpus, various books, code, etc. It has achieved good results on various authoritative evaluation data sets.

Downloads last month
3,223
Safetensors
Model size
15.8B params
Tensor type
F32
·
BF16
·