nnpy
/

Nape-0

Text Generation

Inference Endpoints

text-generation-inference

Model card Files Files and versions Community

nnpy commited on Nov 14, 2023

Commit

47e07bd

•

1 Parent(s): c76d232

Update README.md

Files changed (1) hide show

README.md +24 -2

README.md CHANGED Viewed

@@ -1,6 +1,28 @@
 ---
 license: mit
 ---
-The base model is [PY007/TinyLlama-1.1B-intermediate-step-715k-1.5T](https://huggingface.co/PY007/TinyLlama-1.1B-intermediate-step-715k-1.5T)
-The model didn't complete training yet.

 ---
+language:
+- en
 license: mit
 ---
+Nape-0
+Nape series are small models that tries to exihibit much capabilities.
+The model is still in training process. This is very early preview.
+You can load it as follows:
+```
+from transformers import LlamaForCausalLM, AutoTokenizer
+tokenizer = AutoTokenizer.from_pretrained("nnpy/Nape-0")
+model = LlamaForCausalLM.from_pretrained("nnpy/Nape-0")
+```
+## Training
+It took 1 days to train 3 epochs on 4x A6000s using native deepspeed.
+```
+assistant role: You are Semica, a helpful AI assistant.
+user: {prompt}
+assistant:
+```