nnpy commited on
Commit
47e07bd
1 Parent(s): c76d232

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -2
README.md CHANGED
@@ -1,6 +1,28 @@
1
  ---
 
 
2
  license: mit
3
  ---
4
- The base model is [PY007/TinyLlama-1.1B-intermediate-step-715k-1.5T](https://huggingface.co/PY007/TinyLlama-1.1B-intermediate-step-715k-1.5T)
5
 
6
- The model didn't complete training yet.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - en
4
  license: mit
5
  ---
 
6
 
7
+ Nape-0
8
+
9
+ Nape series are small models that tries to exihibit much capabilities.
10
+ The model is still in training process. This is very early preview.
11
+
12
+ You can load it as follows:
13
+
14
+ ```
15
+ from transformers import LlamaForCausalLM, AutoTokenizer
16
+ tokenizer = AutoTokenizer.from_pretrained("nnpy/Nape-0")
17
+ model = LlamaForCausalLM.from_pretrained("nnpy/Nape-0")
18
+ ```
19
+
20
+ ## Training
21
+ It took 1 days to train 3 epochs on 4x A6000s using native deepspeed.
22
+
23
+ ```
24
+ assistant role: You are Semica, a helpful AI assistant.
25
+ user: {prompt}
26
+ assistant:
27
+
28
+ ```